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Description 



5 The present invention relates to new starch biosynthesis enhancing proteins, nucleic 
acids encoding a starch biosynthesis enhancing protein, a method for producing amy- 
lose with high efficiency by culturing genetically modified plants with an increased 
amylose biosynthesis compared to the wild type or to the genetically modified plants 
themselves as well as the use of these transgenic plants over-expressing at least one 
10 of the starch biosynthesis enhancing proteins for the production of amylose. 

Starch is the major storage carbohydrate of plants and is mainly accumulated in seeds 
and tubers, which are then the reproductive tissues of plants that form those types of 
organs. Starch is also accumulated on a diurnal basis where starch is built up in green 
15 tissue from photosynthetic products and then metabolised for energy during the dark 
period. The storage starch is assembled into semi crystalline granules. Amylopectin 
and.amylose are the two constituent molecules of starch. Amylopectin is a branched 
molecule consisting of linear a-1,4 glucan chains linked by a-1,6 bonds. Amylose con- 
sists essentially of the linear a-1 ,4 glucan chains. 

20 

Starch is utilised for many applications within the technical industry as well as the food 
industry. Main crops used by starch processors are maize and potato. For potato spe- 
cific varieties are utilised for starch production that have been bred for high starch 
contents. This means that the starch content and yield is an important economic driver 

25 for the starch processing industry. A greater part of produced dry starch is used for 

paper production. The specifications and requirements for the starch component varies 
from application to application and starch is many times chemically modified in order to 
provide desired properties to an application. Another way to achieve starch of different 
qualities is to take advantage of mutations in the starch biosynthesis and more recently 

30 by genetic modification of pathways leading to starch. The first main modifications have 
been to separate the production of the two starch components amylopectin and amy- 
lose into different varieties. Waxy or "amylose free" varieties contain solely amylopectin 
type starch while there are also high amylose genotypes such as "amylose extender" in 
maize. 



Amylose starch has several potential industrial uses as a film former or for expanded 
products. High amylose starch can be achieved in potatoes and other starch containing 
plants by inhibition of starch branching enzymes. This leads then to the concomitant 
reduction or elimination of amylopectin branching and thereby an increased amylose 



35 
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fraction. 
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US 5,856,467 describes the genetically engineered modification of potato for suppress- 
ing formation of amylopectin-type starch. The document describes an antisense con- 
struct for inhibiting, to a varying extent, the expression of the gene coding for formation 
of starch branching enzyme ( SBE gene) in potato, said antisense construct comprising 
5 a tuber specific promoter, transcription start and the first exon of the SBE gene, in- 
serted in the antisense direction. 

US 6,169,226 relates to an amino acid sequence of a second starch branching enzyme 
( SBE II) of potato and a fragment thereof as well as to the corresponding isolated DNA 
1 0 sequences. It describes the production of transgenic potatoes and the use of these 
transgenic potatoes for the production of amylose-type starch. 

WO 97/20040 and WO 98/20145 describe methods of altering the amylopectine/ 
amylose starch content of plant cells by introducing into the plant cells nucleic acid 
1 5 sequences operably linked in sense or antisense orientation to a suitable promoter 
which homologous genes encodes polypeptides having SBE I or SBE II activity. 

A side effect of the amylose overproduction is a decreased total starch content in 
the potatoes. This decrease becomes more pronounced as the amylose fraction is 
20 increased. 

Basic enzymes for the production of amylopectin and amylose are starch synthases 
that build the linear a-1, 4 glucan chains and branching enzymes breaking the a-1,4 
glucan chain and reattaching them by a-1, 6 bonds. Several other enzymes are likely 

25 to affect starch structure and composition, such as debranching enzymes, but initially 
most focus has been towards affecting the expression of starch synthases and starch 
branching enzymes. This has led to an extensive dissection of what enzymes are 
important for what features of starch synthesis. However it has never been convincingly 
shown how the synthesis of starch in plants whether amylose or amylopectin is initi- 

30 ated. 

Suggestions on the initiation of starch biosynthesis have been the subject of several 
scientific papers since it has been difficult to attribute a primer independent function to 
starch synthases under other than artificial in vitro conditions. By primer independent 

35 function implies the formation of new a-1 ,4 glucan chains with ADP-glucose as the sole 
starting point and building block. One proposed pathway has been that the presence of 
maltooligosaccharides act as primers for the addition of further glucose units by starch 
synthases although it has been debated on whether concentrations are sufficient to 
provide the basis for starch synthesis and also how these maltooligosaccharides would 

40 be formed in the plastids. 
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Starch is in plants synthesised as an energy storage molecule. Much is known about 
the enzymes participating in the starch biosynthesis although, the initiation of the starch 
molecule has remained unsolved. In mammalians and yeast an energy storage mole- 
5 cule very similar to starch is synthesised, glycogen. The enzymatic steps for synthesis 
of the respective molecules are analogous. In glycogen biosynthesis the initiation of the 
molecule is known and synthesised by the enzyme glycogenic Glycogenin is a self- 
glucosylating enzyme polymerising a linear chain of approximately 8 glucose molecules 
on itself. The primer of about 8 glucose residues is necessary for the enzymes catalys- 
10 ing the continuation of glucose incorporation to the glycogen molecule to function. 

Cheng et al., 1995, Mol. and Cell. Biol. 6632-6640 compare the two yeast proteins with 
rabbit muscle glycogenin. 

15 Roach et aL, 1997, Progress in Nucleic Acid Research and Molecular Biology Vol 57, 
describe self glycosylating initiator proteins and their roll in glycogen biosynthesis. 

Mu et al., 1997, Journal of Biological Chemistry 272 (44), 27589-27597 compare 
mammalian with yeast and C. elegans glycogenins. 

20 

Factors important for starch quantity have been investigated and many initiatives have 
been taken, especially in potato, to increase starch formation and content by over- 
expression or inhibition of various enzyme activities in areas of increased substrate 
supply, increased biosynthesis activity or shutting down substrate diverting pathways 
25 but so far this has led only to limited success with no commercial applications and only 
some scientific publications. 

Regierer, B. et al., Starch content and yield increase as a result of altering adenylate 
pools in transgenic plants. Nat Biotechnol. 20(1 2): 1256-60, (2002). 

30 

Sweetlove, LJ et al., Starch synthesis in transgenic potato tubers with increased 3- 
phosphoglyceric acid content as a consequence of increased 6-phosphof ructokinase 
activity. Planta 213(3):478-82 (2001). 

35 Veramendi, J et aL, Antisense repression of hexokinase 1 leads to an overaccumula- 
tion of starch in leaves of transgenic potato plants but not to significant changes in 
tuber carbohydrate metabolism. Plant Physiol. 1 21 (1):1 23-34 (1999). 

Geigenberger, P et al., Overexpression of pyrophosphatase leads to increased sucrose 
40 degradation and starch synthesis, increased activities of enzymes for sucrose-starch 
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interconversions, and increased levels of nucleotides in growing potato tubers. 
Planta. 205(3):428-37(1 998). 

Sweetlove, LJ et al M Starch metabolism in tubers of transgenic potato (Solanum tube- 
5 rosum) with increased ADPglucose pyrophosphorylase. Biochem J. 320 (2):493-8 
(1996). 

In other research a biochemical function superficially similar to the one initiating glyco- 
gen production in animals was investigated. A class of genes have then been isolated 

10 from several plants and was given the name amylogenin (WO94/04693; Sing, D. et al, 
P-Glucosylarginin: a new glucose-protein bond in a self-glucosylating protein from 
sweet corn, FEBS Letters 376:61-64, (1995) in the belief that it was the plant equiva- 
lent of glycogenin which acts as a self-glycosylating enzyme and provide primers for 
starch biosynthesis in plants. These genes have no resemblance from a structural point 

15 of view to the genes coding for glycogenin and have later been determined not to have 
a function in starch biosynthesis but rather might be of importance for cell wall forma- 
tion , see Bocca, S.N et al., Molecular cloning and characterization of the enzyme 
UDP-glucose: protein transglucosylase from potato. Plant Physiology and Biochemistry 
37(1 1):809-81 9(1 999). 

20 

WO 98/50553 describes nucleic acid fragments encoding a plant glycogenin or a water 
stress protein. WO 98/50553 also relates to the construction of chimeric genes encod- 
ing all or a portion of a plant glycogenin in sense or antisense orientation, wherein 
expression of the chimeric gene results in production of altered levels of a plant glyco- 
25 genin in a transformed host cell. 

Thus although many enzymes and pathways have been investigated in plants, the 
question on how starch formation is initiated and what determines the starch content is 
still unresolved. 

30 

Amylose is a commercially important starch product with many uses but unfortunately 
an increase in amylose content in transgenic potato plants is associated with a signifi- 
cant decrease in starch content, see figure 1. 

35 Analyses of transgenic high amylose potato lines show that there is an excess of solu- 
ble sugars in these lines, see figure 2. This indicates that the starch biosynthesis in 
these transgenic lines is not efficient enough for incorporation of available sugars. 

Amylose starch consists of very few reducing ends compared to native starch. There- 
40 fore it is commercially important to identify genes that further enhance the amylose 
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biosynthesis and that are capable to incorporate the excess of glucose residues avail- 
able and to compensate the decrease in starch content in plants that produces amy- 
lose in high amounts. 

5 The invention aims at enhancing the yield of amylose biosynthesis by the over- 
expression of genes which enhance starch biosynthesis in transformed plants. 

The invention describes genes coding for proteins which enhance starch production. 

10 The present invention describes the nucleic acids SEQ ID NO 1 and 3 from potato 
coding for enzymes enhancing the de novo starch biosynthesis. 

Example 1 describes that the nucleic acid sequences SEQ ID NO 1 or 3 can comple- 
ment a missing glycogenin function in yeast cells containing knock-out mutations for 
1 5 the self-g!ycosy!ating proteins Glg1 p and Glg2p. 

Gene constructs were made for gene-inhibition and over-expression of the two genes 
SEQ ID NO 1 or 3 in potato. Transgenic lines with the over-expressed or inhibited 
enzyme activity were analysed with regard to the genes influence on starch content. 

10 

Both genes were inserted in sense and antisense direction downstream of a plant 
promoter element, resulting in the transformation binary vectors pHSl , pHS2, pHS3 
and pHS4, see figures 3-7. 

*5 The antisense constructs were transformed into the potato plant varieties Prevalent 
and Producent and the sense constructs were transformed to the potato variety 
Desiree and the transgenic plant AM99-2003 according to the transformation method 
as described in example 2. The transgenic plant AM99-2003 was produced as de- 
scribed in example 3. 

0 

Prevalent and Producent are starch varieties having a starch content of approximately 
20 %. Desiree is a potato variety having a starch content of approximately 16% and 
AM99-2003 is a transgenic high amylose line having a starch and thereby amylose 
content of approximately 13%. 

5 

The putative genes were isolated from a tuber specific cDNA library of Solanum tube- 
rosum ( variety Prevalent). The library was made from a lambdaZAP directional kit 
(Stratagene). 
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Both cDNAs isolated were full-length clones of the individual genes and named StGH1 
and StGH2, for nucleic acid sequences see SEQ ID NO 1 and SEQ ID NO 3. 

pHS1 

5 

A 1300bp PCR fragment from the StGH1 gene was constructed in antisense direction 
driven by the gbss promoter. The PCR fragment was cut out from its cloning vector 
pCR4-TOPO (Invitrogen) with EcoRI (blunted) and Xbal. The fragment was ligated to 
the pGPTV-kan (Becker, D. et al M Plant Molecular Biology 20:1 195-1 197(1992 ) based 
1 0 binary vector pHo3.1 between a gbss promoter (WO 92/1 1 376) and a nos terminator at 
the Sail (blunted) and Xbal sites. The binary vector also includes nptll as selection 
marker driven by the nos promoter (Herrera, L et al., 1983). The construct was named 
pHS1 , for details see figure 3a and 4. 

15 pHS2 

A 2300bp full-length cDNA clone of StGH2 was cut out from the cloning vector 
pBluescript (Stratagene) with Xbal and Xhol. The gene was ligated in antisense direc- 
tion between the gbss promoter and nos terminator to the binary vector pHo3.1 at Xbal 
20 and Sail. As can be seen under pHS1 the vector has nptll as selection system. The 
vector was named pHS2, for details see figure 3b and 5. 

pHS3 

25 A full-length StGH1 cDNA, (1780bp) was cut out from the host vector pBluescript with 
EcoRI (blunted) and Bglll and ligated to the BamHI and Smal sites of pUCgbssprom 
(3886bp), containing pUC19 with the gbss promoter and the nos terminator. The plas- 
mid was named pUCGHI . 

30 A fragment with the gbss promoter, the StGH1 gene and the nos terminator was moved 
from pUCGHI with EcoRI (blunted) and Hindlll (2980bp) and ligated to Pstl (blunted) 
and Hindlll opened pSUN1 (WO 02/00900). The plasmid was named pSUNGHI . 

A 3600bp fragment containing the AHAS resistance gene from Arabidopsis thaliana 
35 (Sathasivan, K. et al., Plant Physiology 97(1991), 1044-1050) with nos promoter, see 
Herrera-Estrella, L et al., Nature 303:209-213(1983) and OCS terminator ( Wesley, 
S.V. et al., Plant J. 27(6):581 -590(2001) was ligated to pSUNGHI (9000bp) at the 
Smal site. The vector was given the name pHS3, for details see figure 3c and 6. 
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pHS4 

The gbss promoter and nos terminator was ligated to pBR322 with EcoRI and Hindlll. 
Between the promoter and terminator an EcoRI-Hincll full-length gene pStGH2 was 
5 cloned at the Xbal site. The 3366bp promotor-gene-terminator complex was cut using 
EcoRI (partial digestion) and EcoRV, and ligated to pSUN1 at EcoRI-EcoRV and 
named pSUNGH2. An Xbal fragment with AHAS gene (Arabidopsis thaliana), nos 
promoter and OCS terminator was ligated to pSUNGH2 opened with Xbal (partial 
digestion). The AHAS gene is used as selection marker. The construct was named 
1 0 pHS4, for details see figure 3d and 7. 

Example 2 describes the general method for the transformation of different potato plant 
varieties producing native starch or high amylose type starch with pHSt, pHS2, pHS3 
or pHS4. 

15 

The StGH1 and StGH2 genes were down-regulated in the potato plant varieties Preva- 
lent and Producent by transformation with the genes in antisense direction in relation 
to a plant regulatory element as described in example 4 and 6. Down-regulation of the 
two genes resulted in a decrease in gene expression in transgenic lines compared to 
20 their mother varieties in the order of 50-95%, see example 7 and table 3. Transgenic 
lines transformed with pHSl and pHS2 with confirmed decrease in gene-expression 
have a decrease in dry matter of 7 to 1 1% compared to their mother varieties, see 
example 8 and table 5. 

25 The StGH1 and StGH2 genes were over-expressed in potato driven by the tuber spe- 
cific promoter gbss, as described in example 5. A mutated AHAS gene was used as 
selection marker yielding tolerance to the Imazamox herbicides. Two potato varieties 
were transformed, Desiree and AM99-2003 a transgenic high amylose line with a 40% 
decrease in starch content compared to its parental line. The transformed lines over- 

30 expressing StGH1 and StGH2 were selected as described in example 6. The gene 
expression levels were analysed with real-time PCR, see example 7 and table 3. The 
over-expression of the genes StGH1 and StGH2 resulted in a 2 to 10 times increase in 
gene expression compared to their parental line. Furthermore the lines over-expressing 
StGH1 and StGH2 showed an increase in dry matter of up to 36 % as described in 

35 example 8 and table 5. 

The over-expression of StGH1 and StGH2 in transgenic potato plants producing amy- 
lose type starch resulted in an increased dry matter content, which means an increased 
amylose content as no amylopectin is produced, see examples 8 to 12. 

40 
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RNA interference (RNAi) functions by introduction of double stranded RNA (dsRNA) 
into a cell, which causes a degradation of the homologous RNA. The dsRNA is cleaved 
into small interfering RNA (siRNA) of 21-25 nucleotides by a ribonuclease called Dicer. 
The siRNA connects with a protein complex and forms a RNA-induced silencing com- 
5 plex RISC. The RISC becomes activated by ATP generated unwinding of the siRNA, 
which binds to the homologous transcript and cleaves the mRNA resulting in gene 
silencing, see Mc Manus MT and Sharp PA., Gene silencing in mammals by small 
interfering RNAs. Nature Rev Genet 3:737-747(2002); 

10 Dillin A., The specifics of small interfering RNA specificity. Proc Natl Acad Sci USA 
100(11):6289-6291 (2003); 

Tuschl T., Expanding small RNA interference. Nature Biotechnol 20:446-448 (2002) 

15 Production of high amylose lines was more efficient when using the RNAi constructs 
pHAS3 (figure 23) and pHAS8b (figure 20) than the antisense construct pHAbe12A. 
The frequency of high amylose lines of total transgenic shoots produced when using for 
example pHAS8b and pHAS3 is above >25%, compared to a frequency of approxi- 
mately 1% high amylose lines of total transgenic shoots produced , see example 15 

20 to 17. 

The RNAi constructs pHAS8b (figure 20) and pHAS3 (figure 23) (SEQ ID NO 24) used 
for high amylose potato production contain a be1 and be2 fragment (SEQ ID NO 19) 
cloned in inverted tandem. The constructs are only differing in the spacer used located 
25 between the inverted repeats where for pHAS8b a fragment of the be2 promoter was 
used (SEQ ID NO 18) while for pHAS3 a cloning residue from pBluescript was used 
(SEQ ID NO 23). The RNAi constructs resulted in efficient down-regulation of the 
branching enzyme genes. 

30 Furthermore, the fragments of respective be1 and be2 genes could be shorter or longer 
and could be targeting other parts of the branching enzyme genes. Shorter fragments 
for RNAi of be1 and be2 are described in SEQ ID NO 21 and 22. 

The starch biosynthesis enhancing protein according to the invention comprises the 
35 amino acid sequence SEQ ID NO 2 or 4 or a protein which comprises a sequence 
derived from SEQ ID NO 2 or 4, which is at least 50%, preferably at least 60%, more 
preferably at least 70%, more preferably at least 80%, still more preferably at least 
90%, most preferably at least 95%, identical at the amino acid level to the sequence 
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SEQ ID NO 2 or 4 and has the property of a starch biosynthesis enhancing protein. 
This starch biosynthesis enhancing protein may also be prepared by artificial variations 
starting from the SEQ ID NO 2 or 4, for example by substitution, insertion or deletion of 
amino acids. 

5 

Such a protein can be used to increase the production of amylose or amylopectin in 
non-transgenic or transgenic plants. 

The term "substitution" in the specification means the replacement of one or more 
10 amino acids by one or more amino acids. Preference is given to carrying out "conser- 
vative" replacements in which the amino acids replaced has a property similar to that of 
the original amino acid, for example replacement of Glu by Asp, Gin by Asn, Val by lie, 
Leu by He, Ser by Thr. 

1 5 "Deletion" is the replacement of an amino acid or amino acids by a direct bond. Pre- 
ferred positions for deletions are the polypeptide termini and the junctions between the 
individual protein domains. 

"Insertions" are insertions of amino acids into the polypeptide chain, with a direct bond 
20 formally being replaced by one or more amino acids. 

"Identity" between two proteins means the identity of the amino acids over the in each 
case entire length of the protein, in particular the identity which is calculated by com- 
parison with the aid of the Vector NTI Suite 7.1 Software of the company Informax 
25 (USA) using the Clustal W method (Thompson, JD et al., Nucleic Acid Research, 22 
(22):4673-4680, 1994) 

with the parameters set as follows: 

30 Multiple alignment parameter: 



Gap opening penalty 1 5 

Gap extension penalty 6.66 

Gap separation penalty range 8 

35 Gap separation penalty on 

% identity for alignment delay 40 

Residue specific gaps on 

Hydrophilic residue gap . off 

Transition weighing 0 



40 
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Pairwise alignment parameter: 



FAST algorithm 
K-tuple size 



off 
2 



5 



Gap penalty 
Window size 

Number of best diagonals 



5 
4 
4 



10 Accordingly, a protein which is at least 50% identical at the amino acid level to the 
sequence SEQ ID NO 2 or 4 means a protein which, when comparing its sequence 
with the sequence SEQ ID NO 2 or 4, is at least 50% identical, in particular according 
to the above program algorithm using the above set of parameters. 

1 5 Further natural examples of genes coding for a starch biosynthesis enhancing protein 
according to the invention can readily be found, for example, in various organisms, in 
particular in plants, whose genomic sequence is known by comparing the identity of the 
amino acid sequences or of the corresponding back-translated nucleic acid sequences 
from databases with the sequence of SEQ ID NO 2 or 4, in particular according to the 

20 above program algorithm using the above set of parameters. 

In the completed genome sequence of Arabidopsis thaliana, five putitative coding 
sequences can be deduced by searching for exon/intron boundaries and comparing 
with back translated sequences of SEQ ID NO 2 or 4. 

25 

The following nucleic acid sequences of Arabidopsis thaliana SEQ ID NO 5, SEQ ID 
NO 7, SEQ ID NO 9, SEQ ID NO 1 1 and SEQ ID NO 13 could be used to carry out the 
invention and are coding for the starch biosynthesis enhancing proteins SEQ ID NO 6, 
SEQ ID NO 8, SEQ ID NO 10, SEQ ID NO 12 and SEQ ID NO 14. 

Furthermore the following nucleic acid sequences or ESTs can be used in order to 
identify and clone genes coding for a starch biosynthesis enhancing protein from plant 
organisms: 

Tomato ESTs from GenBank: AW216407, BE450055, BF097262, BE450557, 
BF097173 

Wheat ESTs from GenBank: BJ292476, BJ278875, BJ283925, BE442966, CA666180, 
BQ483228 
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Maize EST from GenBank: BG319971 

Rice ESTs from GenBank: AL606633, CA752890, BI813265 

5 Natural examples of starch biosynthesis enhancing proteins and the corresponding 
genes can furthermore readily be found in various organisms, in particular plants, 
whose genomic sequence is unknown by hybridization techniques in a manner known 
per se, for example starting from the nucleic acid sequences SEQ ID NO 1 or SEQ ID 
NO 3 or any of the SEQ ID NO 5, 7, 9, 1 1 or 13 or any of the EST sequences de- 
10 scribed above. 

The hybridization may be carried out under moderate (low stringency) or, preferably, 
under stringent (high stringency) conditions. 

15 Such hybridization conditions are described, inter alia, in Sambrook, J., Fritsch, E.F., 
Maniatis, T., in: Molecular Cloning (A Laboratory Manual), 2 nd edition, Cold Spring 
Harbor Laboratory Press, 1989, pages 9.31-9.57 or in Current Protocols in Molecular 
Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. 

20 By way of example, the conditions during the washing step may be selected from the , 
range of conditions which is limited by those with low stringency (with 2X SSC at 50°C) 
and those with high stringency (with 0.2X SSC at 50°C, preferably at 65°C) (20X SSC: 
0.3 M sodium citrate, 3 M sodium chloride, pH 7.0). 

25 In addition, the temperature may be raised during the washing step from moderate 
conditions at room temperature, 22°C, to stringent conditions at 65°C. 

Both parameters, salt concentration and temperature, may be varied simultaneously 
and it is also possible to keep one of the two parameters constant and to vary only the 
30 other one. It is also possible to use denaturing agents such as, for example, formamide 
or SDS during hybridization. In the presence of 50% formamide, the hybridization is 
preferably carried out at 42°C. 

Some exemplary conditions for hybridization and washing step are listed below: 

35 

(1 ) hybridization conditions with, for example 
(i) 4X SSC at 65°C, or 



40 



(ii) 6X SSC at 45°C, or 
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(iii) 6X SSC at 68°C, 1 00 mg/ml denatured fish sperm DNA, or 

(iv) 6X SSC, 0.5% SDS, 1 00 mg/ml denatured fragmented salmon sperm DNA 
5 at 68°C, or 

(v) 6X SSC, 0.5% SDS, 100 mg/ml denatured fragmented salmon sperm 
DNA, 50% formamide at 42°C, or 

1 0 (vi) 50% formamide, 4X SSC at 42°C, or 

(vii) 50% (vol/vol) formamide, 0.1 % bovine serum albumin, 0.1 % Ficoll, 0.1 % 
polyvinylpyrrolidone, 50 mM sodium phosphate buffer pH 6.5, 750 mM 
NaCI, 75 mM sodium citrate at 42°C, or 

15 

(viii) 2X or 4X SSC at 50°C (moderate conditions), or 

(ix) 30 to 40% formamide, 2X or 4X SSC at 42°C (moderate conditions). 
20 (2) Washing steps of 1 0 minutes each with, for example 

(i) 0.01 5 M NaCI/0.001 5 M sodium citrate/0. 1 % SDS at 50°C, or 

(ii) 0.1XSSCat65°C, or 

25 

(iii) 0.1 X SSC, 0.5% SDS at 68°C, or 

(iv) 0. 1 X SSC, 0.5% SDS, 50% formamide at 42°C, or 
30 (v) 0.2X SSC, 0. 1 % SDS at 42°C, or 

(vi) 2X SSC at 65°C (moderate conditions). 

Preferred proteins with starch biosynthesis enhancing activity are proteins from plants, 
35 cyanobacteria, mosses or algae, particular preferred from plants. A particular preferred 
protein comprises the amino acid sequence SEQ ID NO 2 or 4. 



40 



If, for example, the protein is to be expressed in a plant, it is frequently advantageous 
to use the codon usage of said plant for backtranslation and resynthesis of the gene 
according to codon usage of said plant. 
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The invention further relates to nucleic acids encoding a starch biosynthesis enhancing 
protein according to the invention. All of the nucleic acids mentioned in the specification 
may be, for example, a RNA sequence, DNA sequence or cDNA sequence. 

5 

Suitable nucleic acid sequences can be obtained, for example, by back-translating the 
polypeptide sequence according to the genetic code. For this, preference is given to 
using those codons which are used frequently according to the organism-specific 
codon usage. The codon usage can be readily determined on the basis of computer 
10 analyses of other known genes of the organisms in question. 



All of the above-mentioned genes coding for a starch biosynthesis enhancing protein 
can furthermore be prepared in a manner known per se from the nucleotide building 
blocks by chemical synthesis, for example by fragment condensation of individual 

15 overlapping complementary nucleic acid building blocks of the double helix. The 
chemical synthesis of oligonucleotides may be carried out, for example, in a known 
manner according to the phosphoramidite method (Voet, Voet, 2nd edition, Wiley Press 
New York, pages 896-897). The annealing of synthetic oligonucleotides and filling-in 
of gaps with the aid of the Klenow fragment of DNA polymerase and ligation reactions 

20 and also general cloning methods are described in Sambrook et al. (1989), Molecular 
cloning: A laboratory manual, Cold Spring Harbor Laboratory Press. 



Genes coding for this function may be integrated in the plant chromosomes and upon 
expression utilize a transit peptide to localise to plastids which is the organelle where 

25 starch/amylose biosynthesis takes place or be integrated directly into the plastid 

genome and thereby suipass the need for the localisation signal. The genes may be 
expressed constitutively or organ specific. For organ specific expression, promoters 
with tuber specific expression is preferable in potatoes while in cereals as maize or 
wheat a endosperm specific expression would be preferred to achieve a high degree 

30 of expression in organs where storage starch is accumulated. When transformed to 
the plastid genome then specific regulatory elements suitable for that organelle apply. 

The genes of this invention may be used in combination with other genes that can be 
situated on the same gene construct or transferred and combined by co-transformation 
35 or super transformation. Genes and traits that are of interest to combine with the genes 
of the instant invention are agronomic or input trait such as herbicide tolerance, dis- 
ease and pest resistance or stress tolerance but could also be output traits such as 
starch structure modification or yield. Genes and traits used in combination with the 
genes described in the invention could be for adding a function that is not present in 
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the modified plant species or over-expressing a function that is already present or 
inhibiting a function by the use of antisense, RNAi or antibodies. 

The invention may be used to increase the starch or amylose content in potato tubers 
5 but would in its context not be limited to potatoes but would be applicable to other 
starch producing and storing plants such as e.g. corn, cassava, wheat, barley, oat and 
rice. 

The described invention is particularly suited for eliminating a lower starch content 
10 associated with increased amylose content in different plants where the number of a- 
1 ,4-glucan chain non-reducing ends is greatly reduced due to the reduction or elimina- 
tion of a-1 ,6 branch formation. Amyiopectin is an extremely efficient structure, as is 
glycogen, for polysaccharide production since it is very branched and thus contains as 
many points accessible for starch synthesis as there are non-reducing ends. Starch 
15 that is mainly composed of amylose, contains much fewer branches and thus the bio- 
synthetic capacity is reduced. In order to enhance starch biosynthesis when there is no 
amyiopectin production, expression of genes as described in the present invention, 
could for example form new primers that can replace amyiopectin as a source for 
starch biosynthesis capacity and thereby reduce or eliminate the lost capacity for 
20 starch synthesis. To further illustrate the situation the degree of branching in ordinary 
potato starch is approximately 3.1% while in high amylose starch it is 0.3-1 .0% depend- 
ing on amylose content. This decrease of branching and starch content is further asso- 
ciated with an increase in glucose and fructose content. 

25 The increased amylose content and thereby solids content is also advantageous for the 
processing properties in various applications such as for french fries, potato crisps and 
other potato based products. In addition to an increased solid content, the inserted 
genes SEQ ID NO 1 or 3 of the present invention result in the transformation of excess 
sugars into a-1 ,4-glucan chains and thereby reducing browning of fried potato prod- 

30 ucts, Maillard reaction, in which amino acids react with free sugars. 
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Furthermore 

(i) any gene of plant origin with the described activity can be used for increasing 
amylose content and solids 

(ii) the genes can be controlled by any regulating promoter element functional in 
5 plant. 

(iii) any starch producing crop of any variety can be transformed with the described 
genes. 

(iiii) any plant transformation method can be used. 

(Hiii) any binary vector can be used for the insertion of the described genes. 
1 0 (iiiiii) the described genes can be combined with any other desired transgenically 
inserted traits. 



The invention further relates to a method for producing amylose by culturing plants 
which have, compared to a wild type or a genetically modified plant producing already 
15 amyiose type starch, an increased amylose biosynthesis activity, said proteins compris- 
ing the amino acid sequence SEQ ID NO 2 or 4 or a sequence which is derived from 
one of these sequences by substitution, insertion or deletion of amino acids and which 
is at least 50% identical at the amino acid level to the sequence SEQ ID NO 2 or 4. 

20 Increased amylose biosynthesis activity compared to the wild type or transgenic line 
means that the amount of amylose formed is increased by the starch biosynthesis 
enhancing protein in comparison with the wild type or transgenic line. 

This increase in starch or amylose biosynthesis activity is preferably at least 5%, fur- 
25 ther preferably at least 10%, further preferably at least 20%, further preferably at least 
50%, more preferably at least 100%, still more preferably at least 200%, in particular at 
least 500%, of the protein activity of the wild type or transgenic line. 



A "wild type" means the corresponding genetically unmodified starting plant. This plant 
30 is preferably Solanum tuberosum. 



Depending on the context, the term "plant" means a wild type starting plant or a geneti- 
cally modified starting plant. 



35 "Transgenic plant" or "genetically modified plant" means that the plant contains an 

additional inserted gene segment that may be foreign or endogenous to the plant spe- 
cies, additional genes or additional gene fragments in sense and/or antisense orienta- 
tion to a suitable promoter corresponding to the following polypeptides and showing 
enzymatic activity of a starch branching enzyme I, a starch branching enzyme II and/or 
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the starch biosynthesis enhancing protein as specified in SEQ ID NO 1 or 3 or poly- 
nucleotides having at least 60 % sequence identity thereof. 

"Amylose type starch" means that the amyiose content of the starch is increased com- 
5 pared to the amylose content of starch produced by wild type plants especially wild 
type potato plants. 

The starch or amylose biosynthesis activity may be increased in various ways, for 
example by eliminating inhibiting regulatory mechanisms at the translation and protein 
10 levels or by increasing the gene expression of a nucleic acid encoding a starch bio- 
synthesis enhancing protein compared to the wild type or transgenic plant, for example 
by Inducing a gene encoding the starch biosynthesis enhancing protein via activators 
or by introducing into the plant nucleic acids encoding a starch biosynthesis enhancing 
protein. 

15 

According to the invention, increasing the gene expression of a nucleic acid encoding a 
starch biosynthesis enhancing protein could also mean manipulating the expression of 
the endogenous starch biosynthesis enhancing protein intrinsic to the plant, in particu- 
lar in potato plants. This may be achieved, for example, by modifying the promoter 
20 DNA sequence of genes encoding a starch biosynthesis enhancing protein. Such a 
modification which leads to a modified or preferably increased rate of expression of at 
least one endogenous gene encoding a starch biosynthesis enhancing protein may be 
carried out by deleting or inserting DNA sequences. 

25 It is also possible to modify expression of one or more endogenous starch biosynthesis 
enhancing protein by applying exogenous stimuli. This may be carried out by particular 
physiological conditions, i.e. by applying foreign substances. 

Furthermore, it is possible to achieve a modified or increased expression of at least 
30 one endogenous gene encoding a starch biosynthesis enhancing protein by the inter- 
action of a regulatory protein which is modified or is not present in the untransformed 
plant. 

In a preferred embodiment, the starch biosynthesis enhancing protein activity is in- 
35 creased compared to the wild type or transgenic plant by increasing the gene expres- 
sion of a nucleic acid encoding a starch biosynthesis enhancing protein, said starch 
biosynthesis enhancing protein comprising the amino acid sequence SEQ ID NO 2 or 
4 or a sequence which is derived from said sequences by substitution, insertion or 
deletion of amino acids and which is at least 50% identical at the amino acid level to 
40 the sequence SEQ ID NO 2 or 4. 
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In the case of genomic nucleic acid sequences encoding a starch biosynthesis enhanc- 
ing protein from eukaryotic sources, which contain introns, preferably already proc- 
essed nucleic acid sequences such as the corresponding cDNAs are to be used, if the 
5 host organism is unable to or cannot be enabled to express the corresponding starch 
biosynthesis enhancing protein. 

In this preferred embodiment, the transgenic plant of the invention thus contains, com- 
pared to the wild type or transgenic plant, at least one further gene encoding a starch 
10 biosynthesis enhancing protein. In this preferred embodiment, the genetically modified 
plant of the invention has accordingly at least one transgenic endogenous or exoge- 
nous nucleic acid encoding a starch biosynthesis enhancing protein. 

Suitable and preferred nucleic acids are described above. In a particularly preferred 
15 embodiment, a nucleic acid comprising the sequence SEQ ID NO 1 or 3 is introduced 
into the plant. 

According to the invention, organisms means preferably eukaryotic organisms, such 
as, for example, yeasts, algae, mosses, fungi or plants, which are capable of producing 
20 starch or amylose, either as wild type or enabled by genetic modification. Preferred 
organisms are photosynthetically active organisms such as, for example, plants which, 
even as a wild type, are capable of producing starch or amylose type starch. 

Particularly preferred organisms are potato plants. 

25 

The present invention furthermore relates to the use of proteins comprising the amino 
acid sequence SEQ ID NO 2 or 4 or a sequence which is derived from this sequence 
by substitution, insertion or deletion of amino acids and which is at least 50% identical 
at the amino acid level to the sequence SEQ ID NO 2 or 4 and having starch biosyn- 
30 thesis enhancing activity. 

The present invention further relates to the use of nucleic acids SEQ ID NO 1 or 3 or 
one of the SEQ ID NOs 5, 7, 9, 11 or 13 encoding proteins having a starch biosynthe- 
sis enhancing activity in plants. 

35 

The transgenic organisms, in particular plants, are preferably prepared by transforming 
the starting organisms, in particular plants, with a nucleic acid construct containing the 
above-described nucleic acid, encoding a starch biosynthesis enhancing protein which 
is functionally linked to one or more regulatory signals ensuring transcription and trans- 
40 lation in said organisms. 
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These nucleic acid constructs in which the coding nucleic acid sequence is functionally 
linked to one or more regulatory signals ensuring transcription and translation in organ- 
isms, in particular in plants, are also referred to as expression cassettes herein below. 

5 

Accordingly, the invention further relates to nucleic acid constructs, in particular to 
nucleic acid constructs functioning as expression cassette, which comprise a nucleic 
acid encoding a starch biosynthesis enhancing protein which is functionally linked to 
one or more regulatory signals ensuring transcription and translation in organisms, in 
1 0 particular in plants. 

The regulatory signals preferably comprise one or more promoters ensuring transcrip- 
tion and translation in organisms, in particular in plants. 

15 The expression cassettes include regulatory signals, i.e. regulatory nucleic acid se- 
quences, which control expression of the coding sequence in the host cell. According 
to a preferred embodiment, an expression cassette comprises upstream, i.e. at the 5' 
end of the coding sequence, a promoter and downstream, i.e. at the 3' end, a polyade- 
nylation signal and, where appropriate, further regulatory elements which are opera- 

20 tively linked to the coding sequence for at least one of the above-described genes 

located in between. Operative linkage means the sequential arrangement of promoter, 
coding sequence, terminator and, where appropriate, further regulatory elements in 
such a way that each of the regulatory elements can properly carry out its function in 
the expression of the coding sequence. 

25 

When the organism used is a plant, the nucleic acid constructs and expression cas- 
settes of the invention preferably contain a nucleic acid encoding a plastid transit pep- 
tide ensuring localisation in plastids. 

30 The preferred nucleic acid constructs, expression cassettes and vectors for plants and 
methods for preparing transgenic plants and also the transgenic plants themselves are 
described in examples 2 to 6 below. 

The sequences preferred for operative linkage, but not limited thereto, are targeting 
35 sequences for ensuring subcellular localisation to plastids such as amyloplasts or 

chloroplasts but could also mean in the apoplasts, in the vacuole, in the mitochondrion, 
in the endoplasmic reticulum (ER), in the nucleus, in elaioplasts or in other compart- 
ments and translation enhancers such as the tobacco mosaic virus 5'-leader sequence 
(Gallie et aL, Nucl. Acids Res. 15 (1987), 8693-8711). 
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A suitable promoter of the expression cassette is in principle any promoter which is 
able to control the expression of foreign genes in plants. 

"Constitutive" promoter means those promoters which ensure expression in numerous, 
5 preferably all, tissues over a relatively long period of plant development, preferably 
during the entire plant development. 

Preference is given to using, in particular, a promoter from plants or a promoter origi- 
nating from a plant virus. Preference is in particular given to the promoter of the 35S 
10 transcript of the CaMV cauliflower mosaic virus (Franck et al. (1980) Cell 21 :285-294; 
Odell et al. (1985) Nature 313:810-812; Shewmaker et al. (1985) Virology 140:281-288; 
Gardner et al. (1986) Plant Mol Biol 6:221-228) or the 19S CaMV promoter 
(US 5,352,605; WO 84/02913; Benfey et al. (1989) EMBO J 8:2195-2202). 

15 Another suitable constitutive promoter is the Rubisco small subunit (SSU) promoter 
(US 4,962,028), the leguminB promoter (GenBank Acc. No. X03677), the Agrobacte- 
rium nopaline synthase promoter, the TR double promoter, the agrobacterium OCS 
(octopine synthase) promoter, the ubiquitin promoter (Holtorf S et al. (1995) Plant Mol 
Biol 29:637-649), the ubiquitin 1 promoter (Christensen et al. (1992) Plant Mol Biol 

20 18:675-689; Bruce et al. (1989) Proc Natl Acad Sci USA 86:9692-9696), the Smas 
promoter, the cinnamyl alcohol dehydrogenase promoter (US 5,683,439), the promot- 
ers of the vacuolar ATPase subunits or the promoter of a proline-rich wheat protein 
(WO 91/13991), the Pnit promoter (Y07648.L, Hillebrand et al. (1998), Plant. Mol. Biol. 
36, 89-99, Hillebrand etal. (1996), Gene, 170, 197-200) and other promoters of genes 

25 whose constitutive expression in plants is known to the skilled worker. 

The expression cassettes may also contain a chemically inducible promoter (review: 
Gatz et al. (1997) Annu Rev Plant Physiol Plant Mol Biol 48:89-108) which may be 
used to control expression of the starch biosynthesis enhancing protein gene in the 

30 plants at a particular time. Promoters of this kind, such as, for example, the PRP1 
promoter (Ward et al. (1993) Plant Mol Biol 22:361-366), salicylic acid-inducible pro- 
moter (WO 95/19443), a benzenesuifonamide-inducible promoter (EP 0 388 186), a 
tetracycline-inducible promoter (Gatz et al. (1992) Plant J 2:397-404), an abscisic acid- 
inducible promoter (EP 0 335 528) and an ethanol- or cyclohexanone-inducible pro- 

35 moter (WO 93/21334), may likewise be used. 

Further examples of suitable promoters are fruit ripening-specific promoters such as, 
for example, the fruit ripening-specific promoter from tomato (WO 94/21 794, EP 409 
625). Development-dependent promoters partly include the tissue-specific promoters, 
40 since individual tissues are naturally formed in a development-dependent manner. 
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Furthermore, preference is given in particular to those promoters which ensure expres- 
sion in tissues or parts of the plant, in which, for example, biosynthesis of starch or 
amylose or of the precursors thereof takes place. Preference is given, for example, to 
5 promoters with specificities for leaves, stems, roots, seeds and tubers. 

Seed-specific promoters are, for example, the phaseoline promoter (US 5,504,200; 
Bustos MM et al. (1989) Plant Cell 1(9):839-53), the promoter of the 2S albumin gene 
(Joseffson LG etal. (1987) J Biol Chem 262:12196-12201), the legumin promoter 

1 0 (Shirsat A et al. (1 989) Mol Gen Genet 21 5(2): 326-331 ), the USP (unknown seed 
protein) promoter (Baumlein H et al. (1991) Mol Gen Genet 225(3):459-67), the pro- 
moter of the napin gene (US 5,608,152; Stalberg K et al. (1996) L Planta 199:515-519), 
the sucrose-binding protein promoter (WO 00/26388) and the legumin B4 promoter 
(LeB4; Baumlein H et al. (1991) Mol Gen Genet 225: 121-128; Baeumlein et al. (1992) 

15 Plant Journal 2(2):233-9; Fiedler U et al. (1995) Biotechnology (NY) 13(10):1090f), the 
Arabidopsis oleosin promoter (WO 98/45461), the Brassica Bce4 promoter (WO 
91/13980) and the vicillin promoter (Weschke et al. 1988, Biochem. Physiol. Pflanzen 
183, 233-242; Baumlein H et al. (1991) Mol Gen Genet 225(3) :459-67). 

20 Further suitable seed-specific promoters are those of the genes coding for high mole- 
cular weight glutenine (HMWG), gliadin, branching enzyme, ADP glucose pyrophos- 
phatase (AGPase) and starch synthase. Preference is further given to promoters which 
allow seed-specific expression in monocotyledons such as e.g. corn, barley, wheat, 
rye, rice, etc. It is also possible to use advantageously the promoter of the Ipt2 or Iptl 

25 gene (WO 95/15389, WO 95/23230) or the promotors described in WO 99/1 6890 

(promoters of the hordein gene, the glutelin gene, the oryzin gene, the prolamin gene, 
the gliadin gene, the glutelin gene, the zein gene, the kasirin gene and the secalin 
gene). 

30 Examples of tuber-, storage root- or root-specific promoters are the patatin promoter 
class I (B33), the potato cathepsin D inhibitor promoter and the potato granular bound 
starch synthase (GBSS) promoter as described in EP-A 0 921 191. 

Examples of leaf-specific promoters are the cytosolic FBPase promoter from potato 
35 (WO 97/05900), the rubisco (ribulose-1 ,5-bisphosphate carboxylate) SSU (small sub- 
unit) promoter and the potato ST-LSI promoter (Stockhaus et al. (1989) EMBO J 
8:2445-2451). 
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Further promoters suitable for expression in plants have been described (Rogers et al. 
(1987) Meth in Enzymol 153:253-277; Schardl etal. (1987) Gene 61:1-11; Berger et al. 
(1989) Proc Natl Acad Sci USA 86:8402-8406). 

5 The site of starch and amylose biosynthesis in potato plants is the amylopiast. There- 
fore amyloplast-specific targeting and activity of the gene products of the inventive 
nucleic acids SEQ ID NO 1 or 3 encoding a starch biosynthesis enhancing protein is 
desirable. 

10 The expression may also take place in a tissue-specific manner in all parts of the plant. 

A further preferred embodiment therefore relates to a tuber-specific expression of the 
nucleic acids SEQ ID NO 1 or 3. 

15 In addition, a constitutive expression of the gene encoding a starch biosynthesis en- 
hancing protein is advantageous. On the other hand, however, an inducible expression 
of this gene may also be desirable. 

An expression cassette is preferably prepared by fusing a suitable promoter to an 
20 above-described nucleic acid encoding a starch biosynthesis enhancing protein and/ 
preferably, to a nucleic acid which has been inserted between promoter and nucleic 
acid sequence and which codes for an amyloplast-specific transit peptide and also to a 
polyadenylation signal according to familiar recombination and cloning techniques as 
described, for example, in T. Maniatis, E.F. Fritsch and J. Sambrook, Molecular Clon- 
25 ing: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
(1989) and in T.J. Silhavy, M.L. Berman and L.W. Enquist, Experiments with Gene 
Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1984) and in 
Ausubel, F.M. et al., Current Protocols in Molecular Biology, Greene Publishing Assoc. 
and Wiley-lnterscience (1987). 

30 

Particular preference is given to inserted nucleic acid sequences which ensure target- 
ing in the amyloplasts. 

It is also possible to use an expression cassette in which the nucleic acid sequence 
35 encodes a starch biosynthesis enhancing protein fusion protein, one part of the fusion 
protein being a transit peptide which controls translocation of the polypeptide. Prefer- 
ence is given to amyloplast-specific transit peptides which, after translocation of starch 
biosynthesis enhancing protein into the amyloplasts, are enzymatically cleaved off the 
starch biosynthesis enhancing protein part. 
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Particular preference is given to the transit peptide which is derived from the Nicotiana 
tabacum plastid transketolase or from another transit peptide (e.g. the transit peptide of 
the rubisco small subunit or of ferredoxin NADP oxidoreductase and also of isopentenyl 
pyrophosphate isomerase-2) or from its functional equivalent. 

5 

Further examples of a plastid transit peptide are the transit peptide of the plastid 
isopentenyl pyrophosphate isomerase-2 (IPP-2) from Arabidopsis thaliana and the 
transit peptide of the ribulose bisphosphate carboxylase small subunit (rbcS) from pea 
(Guerineau, F, Woolston, S, Brooks, L, Mullineaux, P (1988) An expression cassette 
10 for targeting foreign proteins into the chloroplasts. Nucl. Acids Res. 16: 1 1380). 

Plant genes of the invention which encode a plant starch biosynthesis enhancing pro- 
tein may already contain the nucleic acid sequence which encodes a plastid transit 
peptide. In this case, a further transit peptide is not required. For example, the Solanum 
15 tuberosum sequences of the starch biosynthesis enhancing protein of the invention 
SEQ ID NO 1 or 3 contain already a transit peptide sequence. 

The nucleic acids of the invention may be prepared synthetically or obtained naturally 
or comprise a mixture of synthetic and natural nucleic acid components and may also 
20 be composed of various heterologous gene sections of various organisms. 

As described above, preference is given to synthetic nucleotide sequences with codons 
which are preferred by plants. These codons which are preferred by plants may be 
determined from codons which have the highest frequency in proteins and which are 
25 expressed in most of the interesting plant species. 

When preparing an expression cassette, it is possible to manipulate various DNA 
fragments in order to obtain a nucleotide sequence which expediently can be read in 
the correct direction and is provided with a correct reading frame. The DNA fragments 
30 may be linked to one another by attaching adaptors or linkers to said fragments. 

It is furthermore possible to use manipulations which provide appropriate restriction 
cleavage sites or which remove excess DNA or restriction cleavage sites. In those 
cases for which insertions, deletions or substitutions such as, for example, transitions 
35 and transversions are suitable, in vitro mutagenesis, primer repair, restriction or ligation 
can be used. 

Preferred polyadenylation signals are polyadenylation signals functional in plants, 
exemplified by those which correspond essentially to T-DNA polyadenylation signals 
40 from Agrobacterium tumefaciens, in particular of the T-DNA gene 3 (octopine synthase) 
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or OCS terminator, the complete sequence of the Ti plasmid pTiACHS (Gielen et ah, 
EMBO J. 3, 835 -846(1984) or functional equivalents. 



The invention further relates to the use of the nucleic acids SEQ ID NO 1 or 3 for in- 
5 creasing the starch or amylose content in plants, e.g. potato plants which, as wild type, 
are capable of producing starch or amylose, see examples 2 to 13. 

The invention Is not limited to the over-expression of the nucleic acid sequences 
SEQ ID NO 1 or SEQ ID NO 3 in plants especially potato plants. 

10 

The over-expression of both nucleic acid sequences SEQ ID NO 1 and 3 in a plant can 
be used for enhancing amylose biosynthesis, see examples 14-16. Constructs contain- 
ing the nucleic acids SEQ ID NO 1 and SEQ ID NO 3 can also be used for increasing 
the starch content or the amylopectin content in plants. These constructs can be made 
15 on the same T-DNA driven by one promoter each. These constructs can also be made 
on the same T-DNA in tandem driven by the same promoter. These constructs can 
also be transformed using more than one construct, either at the same time (co-trans- 
formation) or in different transformation events. 

20 The above-described proteins and nucleic acids may be used for producing starch or 
amylose in transgenic plants. 

The transfer of foreign genes into the genome of an organism, in particular of a plant, 
is referred to as transformation. 

25 

For this purpose, methods known per se for transforming plants and regenerating 
plants from plant tissues or plant cells can be used, in particular in plants, for transient 
or stable transformation, e.g. as described in example 2. 

30 Suitable methods for the transformation of plants are the protoplast transformation by 
polyethylene glycol-induced DNA uptake, the biolistic method using the gene gun - also 
known as particle bombardment method, electroporation, the incubation of dry embryos 
in a DNA-containing solution, microinjection and the above-described Agrobacterium- 
mediated gene transfer. Said methods are described, for example, in B. Jenes et aL, 

35 Techniques for Gene Transfer, in: Transgenic Plants, Vol. 1 , Engineering and Utili- 
zation, edited by S.D. Kung and R. Wu, Academic Press (1993), 128-143 and in 
Potrykus, Annu. Rev. Plant Physiol. Plant Molec. Biol. 42 (1991), 205-225). 
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The construct to be expressed is preferably cloned into a vector which is suitable for 
transforming Agrobacterium tumefaciens, for example pBin19 (Bevan et al., Nucl. Acids 
Res. 12 (1984), 8711) or preferably pSUN2 (WO 02/00900). 

5 Accordingly, the invention furthermore relates to vectors containing the above- 
described nucleic acids, nucleic acid constructs or expression cassettes. 

Agrobacteria which have been transformed with an expression cassette can be used in 
a known manner for the transformation of plants, for example by bathing injured leaves 
10 or leaf sections in an Agrobacterium solution and then culturing them in suitable media. 

Apart from in plants, the expression cassette may also be used for transforming bacte- 
ria, in particular cyanobacteria, mosses, yeasts, filamentous fungi and algae. 

1 5 Genetically modified plants, also referred to as transgenic plants herein below, are 

preferably prepared by cloning the fused expression cassette which expresses a starch 
biosynthesis enhancing protein into a vector, for example pBin19, which is suitable for 
transforming Agrobacterium tumefaciens. 

20 Agrobacteria which have been transformed with such a vector may then be used in a 
known manner for the transformation of plants, in particular of crop plants, for example 
by bathing injured leaves or leaf sections in an Agrobacterium solution and then cultur- 
ing them in suitable media. 

25 The transformation of plants by Agrobacteria is described, inter alia, in F.F. White, 
Vectors for Gene Transfer in Higher Plants; in Transgenic Plants, Vol. 1 , Engineering 
and Utilization, edited by S.D. Kung and R. Wu, Academic Press, 1993, pp. 15-38. 
Transgenic plants which contain a gene for expression of a nucleic acid encoding a 
starch biosynthesis enhancing protein, which has been integrated into the expression 

30 cassette, can be regenerated in a known manner from the transformed cells of the 
injured leaves or leaf sections. 

A host plant is transformed with a nucleic acid SEQ ID NO 1 or 3 encoding a starch 
biosynthesis enhancing protein by incorporating an expression cassette as insertion 
35 into a recombinant vector whose vector DNA comprises additional functional regulatory 
signals, for example sequences for replication or integration. Suitable vectors are 
described inter alia, in Methods in Plant Molecular Biology and Biotechnology (CRC 
Press), chapter 6/7, pp. 71-119 (1993). 
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By way of example, the plant expression cassette may be Incorporated into a derivative 
of the transformation vector pBin-19 with 35s promoter (Bevan, M., Nucleic Acids Re- 
search 12: 8711-8721 (1984). 

5 Using the above-cited recombination and cloning techniques, it is possible to clone the 
expression cassettes into suitable vectors for maintenance and propagation of genetic 
material for example in E. coli. Suitable cloning vectors are, inter alia, pBR322, pUC 
series, M13mp series, pBluescript and pACYC184. Particularly suitable are binary 
vectors which can replicate both in E coli and in agrobacteria. 

10 

The invention therefore further relates to the use of the above-described nucleic acids 
or of the above-described nucleic acid constructs, in particular of the expression cas- 
settes, for preparing genetically modified plants or for transforming plants, plant cells, 
plant tissues or parts of plants. 

15 

The use is preferably aimed at increasing the starch or amylose content of the plant, of 
the tubers or in other parts of the plant. 

The use is most preferably aimed at increasing the starch or amylose content of wild- 
20 type or transgenic potato plants and especially the tubers of wild-type or transgenic 
potato plants. 

Accordingly, the invention further relates to a method for preparing genetically modified 
plants by introducing an above-described nucleic acid or an above-described nucleic 
25 acid construct into the genome of the starting organism. 

The invention further relates to the genetically modified organisms, the genetic modifi- 
cation increasing the activity of a starch biosynthesis enhancing protein compared to a 
wild type or transgenic plant and the starch biosynthesis enhancing protein comprising 
30 the amino acid sequence SEQ ID NO 2 or 4 or a sequence which is derived from this 
sequence by substitution, insertion or deletion of amino acids and which is at least 50% 
identical at the amino acid level to the sequence SEQ ID NO 2 or 4. 

As illustrated above, the starch biosynthesis enhancing protein activity is increased 
35 compared to the wild type or transgenic plant preferably by increasing the gene ex- 
pression of a nucleic acid encoding a starch biosynthesis enhancing protein. 

In a further preferred embodiment, gene expression of a nucleic acid encoding a starch 
biosynthesis enhancing protein is increased, as illustrated above, by introducing nu- 
40 cleic acids encoding a starch biosynthesis enhancing protein into the organism and 
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thus by over-expressing nucleic acids encoding a starch biosynthesis enhancing pro- 
tein. 



Such transgenic plants, their propagation material and their plant cells, plant tissues, 
5 plant parts or tubers are a further subject of the present invention. 

Genetically modified plants of the invention, which have an increased starch or amy- 
lose content and which can be consumed by humans and animals, can also be used as 
food- or feedstuffs or as feed and food supplements, for example directly or after proc- 
10 essing known per se. The genetically modified plants may furthermore be used for 

producing starch or amylose-containing extracts of said plant and/or for producing feed 
and food supplements. 

The invention further relates to: 

15 

I. A polynucleotide that encodes a polypeptide of SEQ ID NO 1 or 3. 

II. A polynucleotide comprising at least 30 contiguous bases of SEQ ID NO 1 or 3. 

20 III. A polynucleotide having at least 60 % sequence identity to SEQ ID NO 1 or 3, 
wherein the identity is based on the entire coding sequence. 

IV. A polynucleotide having at least 60 % sequence identity to SEQ ID NO 1 or 3, 
wherein the % sequence identity is based on the entire sequence. 

25 

V. A polynucleotide which selectively hybridizes, under stringent conditions and a 
wash in 2 X SSC at 50 °C, to a hybridization probe derivable from the polyn- 
ucleotide sequence as set forth in SEQ ID NO 1 or 3, or from the genomic 
sequence. 

30 

VI. A polynucleotide complementary to a polynucleotide of V. 

VII. The polynucleotide of I, wherein the starch or amylose biosynthesis enhancing 
polynucleotide is from Solanum tuberosum. 

35 

VIII. The polynucleotide of I encoding a polypeptide, which after over-expression in a 
plant cell increases the starch or amylose content. 



IX. 

40 



The polynucleotide of I in antisense orientation, which after expression in a 
plant cell decreases the starch or amylose content. 
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X. A vector comprising at least one polynucleotide of i. 

XL An expression cassette comprising at least one polynucleotide of I operably 

5 linked to a promoter, wherein the polynucleotide is in sense or antisense orien- 
tation. 

XII. A host cell which is introduced with at least one expression cassette of X. 

10 XIII. The host cell of XI that is a plant cell. 

XIV. A transgenic plant comprising at least one expression cassette of XI. 

XV. The transgenic plant of XIII, wherein the plant is Solanum tuberosum. 

XVI. A tuber from the transgenic plant of XIV. 



15 



XVII. An isolated protein comprising a member selected from the group consisting of: 

a) a polypeptide comprising at least 10 contiguous amino acids of SEQ ID 
20 NO 2 or 4, 

b) a polypeptide which is a plant starch biosynthesis enhancing protein, 

c) a polypeptide comprising at least 55 % sequence identity to SEQ ID NO 2 
or 4, wherein the sequence identity is based on the entire sequence and 
has at least one epitope in common with a starch biosynthesis enhancing 

25 protein. 

d) a polypeptide encoded by a polynucleotide selected from SEQ ID MO 1 
or 3, 

e) a polypeptide of SEQ ID NO 2 or 4. 

30 XVIII. The protein of XVII, wherein the polypeptide is catalytically active. 

XIX. A ribonucleic acid sequence encoding the protein of XVIII. 

XX. A method for modulating the level of starch biosynthesis enhancing protein in a 
35 plant, comprising: 

a) stably transforming a plant cell with a polynucleotide coding for a starch 
biosynthesis enhancing protein operably linked to a promoter, wherein the 
polynucleotide is in sense or antisense orientation 
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b) growing the plant cell under plant growing conditions to produce a regen- 
erated plant capable of expressing the polynucleotide for a time sufficient 
to modulate the level of starch biosynthesis enhancing protein in the plant. 

The method of XX, wherein the polynucleotide coding for a starch biosynthesis 
enhancing protein is selected from SEQ-ID NO 1 or 3. 

The method of XX, wherein the plant is Solanum tuberosum. 

The method of XX, wherein activity of the starch biosynthesis enhancing protein 
is increased. 

A method for modulating the level of starch or amylose in a plant, comprising: 

a) stably transforming a plant cell with a polynucleotide coding for a starch 
biosynthesis enhancing protein operably linked to a promoter, wherein the 
polynucleotide is in sense or anti-sense orientation, 

b) growing the plant cell under plant growing conditions to produce a regen- 
erated plant capable of expressing the polynucleotide for a time sufficient 
to modulate level of starch or amylose in the plant. 

20 

XXV. A method for modulating the level of starch or amylose in a plant, comprising: 
a) stably transforming a plant cell with a polynucleotide encoding a starch 

biosynthesis enhancing protein operably linked to a promoter, wherein the 
polynucleotide is in sense or anti-sense orientation. 
25 b ) growing the plant cell under plant growing conditions to produce a regen- 

erated plant capable of expressing the polynucleotide for a time sufficient 
to modulate level of starch or amylose in the plant. 

XXVI. The method of XXIV wherein the polynucleotide coding for a starch biosynthesis 
30 enhancing protein is selected from SEQ ID NO 1 or 3. 

Some of the terms used further on in the specification are defined at this point. 

"Enzymatic activity/activity assay": the term enzymatic activity describes the ability of 
35 an enzyme to convert a substrate into a product. In this context, both the natural sub- 
strate of the enzyme and a synthetic modified analog of the natural substrate can be 
used. The enzymatic activity can be determined in what is known as an activity assay 
via the increase in the product, the decrease in the starting material, the decrease or 
increase in a specific cofactor, or a combination of at least two of the aforementioned 
40 parameters as a function of a defined period of time. 



5 XXI. 

XXII. 
10 XXIII. 

XXIV. 

15 
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"Functional equivalents" in the present context describe nucleic acid sequences which 
hybridize under standard conditions with the nucleic acid sequence encoding the starch 
biosynthesis enhancing protein or portions of the nucleic acid sequence encoding the 
starch biosynthesis enhancing protein, and which are capable of bringing about the 
5 expression of an enzymatically active plant starch biosynthesis enhancing protein in a 
cell or an organism. 

It is advantageous to use short oligonucleotides of a length between 10 to 50bp, pref- 
erably 15-40bp, for example of the conserved or other regions, which can be deter- 

10 mined via comparisons with other related genes in a manner known to the skilled 

worker for the hybridization. Alternatively, it is also possible to use longer fragments of 
the nucleic acids according to the invention or the complete sequences for the hybridi- 
zation. These standard conditions vary depending on the nucleic acid used, namely 
oligonucleotide, longer fragment or complete sequence, or depending on which type of 

15 nucleic acid, that is DNA or RNA, is being used for the hybridization. Thus, for exam- 
ple, the melting temperatures for DNA: DNA hybrids are approx. 10oC lower than those 
of DNA:RNA hybrids of equal length. Suitable hybridization conditions are described 
above. 

20 A functional equivalent is furthermore also understood as meaning, in particular, natu- 
ral or artificial mutations of the relevant nucleic acid sequences of the plant starch 
biosynthesis enhancing protein and their homologs from other organisms which make 
possible the expression of the enzymatically active plant starch biosynthesis enhancing 
protein in a cell or an organism. 

25 

Thus, the scope of the present invention also extends to, for example, those nucleotide 
sequences which are obtained by modification of the nucleic acid sequence of a starch 
biosynthesis enhancing protein. The purpose of such a modification can be, for exam- 
ple, the insertion of further cleavage sites for restriction enzymes, the removal of ex- 
30 cess DNA, or the addition of further sequences. Proteins which are encoded via said 
nucleic acid sequences should still maintain the desired functions, despite the deviating 
nucleic acid sequence. 

The term functional equivalent may also refer to the protein encoded by the nucleic 
35 acid sequence in question. In this case, the term functional equivalent describes a 
protein whose amino acid sequence is up to a specific percentage identical with that 
of he starch biosynthesis enhancing protein. 

Functional equivalents thus encompass naturally occurring variants of the sequences 
40 described herein, and also artificial, for example chemically synthesized, nucleic acid 
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sequences adapted to the codon usage, or the amino acid sequences derived there- 
from. 

In general, it can be said that functional equivalents independently of the amino acid 
5 sequence in question (encoded by a corresponding nucleic acid sequence) have in 
each case the enzymatic activity of a starch biosynthesis enhancing protein. 

"Reporter genes" encode readily quantifiable proteins. Using these genes, an assess- 
ment of transformation efficacy or of the site or time of expression can be made via 

10 growth, fluorescence, chemoluminescence, bioluminescence or resistance assay or via 
photometric measurement (intrinsic color) or enzyme activity. Very especially preferred 
in this context are reporter proteins (Schenborn E, Groskreutz D. Mol. Biotechnol. 
1999; 13(1):29-44) such as the "green fluorescence protein" (GFP) (Gerdes HH and 
Kaether C, FEBS Lett. 1996; 389(1 ):44-47; Chui WLet al., Curr. Biol. 1996, 6:325-330; 

15 Leffel SM et al., Biotechniques. 23(5):912-8, 1997), chloramphenicol acetyl transferase, 
a luciferase (Giacomin, Plant Sci. 1996, 116:59-72; Scikantha, J. Bact. 1996, 178:121; 
Millar et al., Plant Mol. Biol. Rep. 1992 10:324-414), and luciferase genes, in general 
b-galactosidase or b-glucuronidase (Jefferson et al., EMBO J. 1987, 6, 3901-3907), 
the Ura3 gene, the Ilv2 gene, the 2-desoxyglucose-6-phosphate phosphatase gene, 

20 b-lactamase gene, the neomycin phosphotransferase gene, the hygromycin phospho- 
transferase gene, or the BASTA (= gluphosinate) resistance gene. 

"Significant increase": referring to the enzymatic activity, is understood as meaning the 
increase in the enzymatic activity of the enzyme incubated with a candidate compound 
25 in comparison with the activity of an enzyme not incubated with the candidate com- 
pound, which lies outside an error in measurement. 

"Substrate": Substrate is the compound which is recognized by the enzyme in its 
original function and which is converted into a product by means of a reaction cata- 
30 lyzed by the enzyme. 

Preferably, the plant starch biosynthesis enhancing protein is encoded by a nucleic 
acid sequence comprising 

35 a) a nucleic acid sequence shown in SEQ ID NO 1 or 3; or 

b) a nucleic acid sequence which, owing to the degeneracy of the genetic code, can 
be deduced from the amino acid sequence shown in SEQ ID NO 2 or 4 by back 
translation; or 

40 
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c) a nucleic acid sequence which, owing to the degeneracy of the genetic code, can 
be deduced from a functional equivalent of the amino acid sequence shown in 
SEQ ID NO 2 or 4, which has an identity with SEQ ID NO 2 or 4 of at least 50%, 
by back translation. 

5 

The functional equivalent of SEQ ID NO 2 or 4 set forth in c) has an identity of at least 
50%, 51%, 52%, 53%, 54%, 55%, 56%, 57% preferably at least 58%, 59%, 60%, 61%, 
62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, and 70% more preferably 71%, 72%, 
73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85% most pref- 
10 erably at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, 99% identity with the SEQ ID NO 2 or 4. 

Potato varieties used for starch production as well as genotypes with a high amylose 
content are transformed with gene constructs as described and in example 17 for the 

15 over-expression of a starch biosynthesis enhancing protein. The over-expression of 
StGH1 or StGH2 in potato plants will result in an increased starch or amylose content 
of the transgenic plant compared to the starting plant. The increase of starch content in 
the transgenic lines can be seen in table 9. The lines also show an increased harvest 
weight when grown in the greenhouse (table 6) thus resulting in an increased starch 

20 yield. 

Example 1 

Complementation study in yeast 

25 Yeast contains two self-glycosylating proteins, Glglp and Glg2p, which yield primers 
for the initiation of glycogen synthesis. For glycogen synthesis to take place in yeast it 
is required that either gene is functional. Yeast strain CC9, contain knock-out mutations 
for both genes and is therefore a null mutant regarding this specific biosynthetic func- 
tion and is therefore unable to produce glycogen (Cheng, C. et aL, Molecular and 

30 Cellular Biology (1995), 6632-6640). CC9 was used as a basis for complementation 
experiments with the isolated potato genes in order to validate their function by restor- 
ing glycogen biosynthesis in the CC9 strain. The potato genes were cloned in a yeast 
plasmid, pRS414 (Stratagene), and expressed with various yeast controlling elements 
such as GaM , Adh1 and Glg2p promoters. CC9 was transformed by the resulting plas- 

35 mids using LiCI and electroporation (Multiporator, Eppendorf). Transformed yeast 
colonies growing on appropriate media plates were screened by immersing in iodine 
solution. Wild type yeast producing glycogen is stained red brown by iodine while the 
null mutant CC9 is not stained. CC9 expressing the potato genes,-StGH1 and StGH2, 
will stain red brown, when the isolated genes complement a glycogenin function in 

40 yeast and thus carry the desired function. 
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Example 2 

Transformation method 

5 Fully expanded leaves from in vitro propagated potato plants are diagonally cut in 
2-4 pieces and precultivated on MC-plates for 2-3 days at 23-24°C. 

Agrobacterium tumefaciens strain LBA4404 containing pHS1, pHS2, pHS3 pHS4, 
PHASHS2, pHASHS4, pHASHS5, pHASHS6, pHASHS7 or pHASHS8 are grown in 
1 0 YEB medium with 100|jg rifampicin and 25pg/ml kanamycin over night on constant 
shaking (200 rpm) at 28°C. 

The Agrobacterium culture is prepared for infection by dilution 1 :20 with MS10 medium. 
The leaf explants are infected for 8-10 min in the bacterial solution and afterwards 

15 drained on filter paper for 5-20 seconds. The leaf segments are placed on the MS300 
plates for 2 days co-cultivation under modest light at 23-24°C. At the end of co-culti- 
ation the leaf segments are moved to M400 plates containing 400 g/l Claforan to 
suppress bacterial growth. After 4-5 days the explants are moved to selection medium 
MS400 supplemented with 400 g/l Claforan. For explants transformed with pHS1 and 

20 pHS2 50pM kanamycin was included in the media and for explants transformed with 
pHS3, pHS4, pHASHS2, pHASHS4, pHASHSS, pHASHS6, pHASHS7 and pHASHS8 
0.5 M Imazamox was added to the media. 

Leaf segments are transferred to fresh MS 400 selection medium every fortnight. The 
regenerated putative transgenic shoots are collected and cultivated on MS30 plates 
25 with 200 g/l Claforan aiming at shoot elongation. 



When the shoots are 3-5 cm long, 1-2 cm are cut off and grown on microtuber medium 
in the dark at 25°C. After 2-5 weeks microtubers are produced. 



MC plates 


MS300 


MS300 plates with 1 .5-2 ml 


4.4 g/l MS-medium 


liquid MS100 medium and 


2 mg/l naphthyl acetic acid 


covered with one sterile 


1 mg/l 6-benzyl amino pyridine 


filter paper 


3% (w/v) sucrose 




pH 5.2 
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MS10 


MS400 


4.4 g/l MS-medium (murashige and 
Skoog) 

1 % (w/v) sucrose 
pH 5.8 


4.4 g/l MS-medium 
2 g/l zeatine 

0.01 mg/I naphthyl acetic acid 

0 1 mn/l aihhprpllir fir id 
10% (w/v) sucrose 
400 mg/I claforan 

0.5 pM Imazamox or 50 pM kanamycin 
pH .8 


MS30 


Microtuber medium 


4.4 g/l MS-medium 
3% (w/v) sucrose 
pH 5.8 


4 4 a/I MS-mpriinm 
2.5 mg/I kinetin 
0.5 mg/I abscisic acid 
8% sucrose 
200 mg/claforan 


MS100 




4.4 mg/I MS-medium 
30 /I sucrose 
0.5 m/lg thiamin-HCl 
0.5 mg/I pyridoxin-HCI 
1 mg/I nicotinacid 
0.5 mg/I kinetin 

29.S mg/I ferrous sulfate hepta hydrate 

1 mg/I 2,4-Dichlorophenoxyacetic acid 

2 g/l caseinhydrolysate 
pH 5.2 




Example 3 

Transgenic plant AM 99-2003 



5 High amylose potato lines can be produced for example by using antisense, RNAi or 
antibody technology that target the two starch branching enzymes starch branching 
enzyme 1 (SBE1 ) and starch branching enzyme 2 (SBE2). 

The high amylose potato line AM99-2003 is produced by inhibition of the starch 
10 branching enzyme activities in the parental line Dinamo. Transformation is made with a 
construct of SBE1 and SBE2 in antisense orientation driven by the gbss promoter. 
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pBluescript containing a 1620bp fragment of the 3'end of Sbe1 between EcoRV and 
Spel is cut open with Spel (blunt) and Xbal and ligated with a 1243bp Sstl (blunt) and 
Xbal fragment of the 3'end of Sbe2. The Sbe2 and Sbe1 complex is cut out with 
5 EcoRV and Xbal and ligated to the Smal and Xbal opened up binary vector pHo3.1 , 
see figure 8. The final vector is named pHAbe12A, see figure 9 and nucleic acid 
sequence SEQ ID NO 15. pHo3.1 is based on pGPTVKan (Becker, D. et al., Plant 
Molecular Biology 20 (1992), 1195-1197) with the addition of the 987bp gbss promoter 
cloned at the Hindlll site of pGPTVKan and the uidA gene is deleted by Smal and Sstl. 

10 

The parental line Dinamo is transformed with the construct pHAbe12A as described in 
example 2. 

Example 4 

15 Down-regulation of StGH1 and StGH2 genes in potato by antisense 

The StGH1 and StGH2 genes were down-regulated in potato by transformation with 
the genes in antisense direction in relation to a plant regulatory element. The respec- 
tive antisense genes were cloned in a binary vector driven by a tuber specific gbss 
20 promoter. Nptll, yielding resistance to the antibiotic kanamycin, was used as selection 
marker. Two varieties were transformed, Prevalent and Producent. The shoots were 
selected on 50 pM kanamycin, which is a standard kanamycin concentration used for 
potato transformation (Ooms, G et al., Theoretical and Applied Genetics 73:744-750 
(1987) and Tavazza, R. et al., Plant Science 59 (1988), 175-181). 

25 

Example 5 

Over-expression of StGH1 and StGH2 genes in potato 

The StGH1 and StGH2 genes were over-expressed in potato driven by the tuber 
30 specific promoter gbss. A mutated AHAS gene was used as selection marker yielding 
tolerance to the Imazamox herbicides. Two potato varieties were transformed, Desiree 
and AM99-2003 a transgenic high amylose line with a 40% decrease in starch content 
compared to its parental line. 

35 Example 6 

Selection of transgenic lines 

Non-transgenic escapes were identified and discarded by a PGR screening method. 
DNA was extracted according to DNeasy 96 Plant protocol (Qiagen). In a 96 well 
40 microtiter plate, 10-15 mg leaf tissue was added to each well together with a 5mm steel 
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ball, each well then representing one individual shoot. The plates were frozen in N 2 (l) 
before homogenisation. The homogenisation was done at 30Hz in a Mixermill300 for 
1 min. The DNA was at the end of the extraction protocol eluted in 75ul H 2 0. 

5 Specific primers for nptll and AHAS were used for the amplification of a 246bp frag- 
ment respective a fragment of 509 bp for selection of successfully transformed lines. 

Npt2_f or 5 '-AGCAAGGTG AGATG ACAGG AG ATC-3 ' 
Npt2_rev 5 'CAGACAATCGGCTGCTCTG ATG-3 ' 

0 

AHAS1_frw: 5 '-AACAACAACATCTTCTTCG ATC-3 ' 
AHAS1_rev: 5 '-TAACGAGATTTGTAGCTCCG-3 '. 

The PCR reactions were with the extracted DNA setup and run as follow: 

5 

Reaction: 



10x PCR Mix 


2,0 pi 


Primer frw (25uM) 


0,4 (Jl 


Primer rev (25uM) 


0,4 Ml 


dNTPs (10mM) 


0,4 Ml 


RedTAQ (Sigma) 


1.0 M" 


Templat (~20ng/ul) 


4,0 Ml 


H 2 0 


11,8 Ml 


PCR program: 




94°C 30 s 




59°C 30 s x29 cycles 


72°C 30s 




72°C 7 min 




8°C old 





Example 7 

Gene expression analysis 

The gene expression levels of the StGH1 and StGH2 genes were analysed in the 
transgenic potato lines with real-time PCR (ABI prism 7900HT, Applied Biosystems). 
With real-time PCR the change of gene expression can be analysed regarding RNA 
expression levels. For pHS1 and pHS2 transgenic lines, expression of both sense and 
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antisense RNA of StGH1 and StGH2 was measured, while in pHS3 and pHS4 trans- 
genic lines the change in StGH1 and StGH2 mRNA expression was analysed. 

The target for pHS1 and pHS2 is to reduce transcript levels of StGH1 and StGH2 
5 respectively while the target for pHS3 and pHS4 is to increase transcript levels of the 
respective genes. 

RNA was isolated from microtubers of the transgenic potato lines and mother varieties 
using Invisorb Spin Plant-RNA mini kit (Invitek). A reverse transcription reaction was 
10 made with 250 ng total RNA in 25|jl total reaction volume using TaqMan reverse tran- 
scription reagents (Applied Biosystems). Separate and specific primers (see table 1) 
were designed and used for the reverse transcription reaction in order to be able to 
differentiate the endogenous expression from the antisense RNA expression of the 
respective genes. 



StGH1 sense RNA 


5'-TGAAGACAGCACAAAACTGG-3' 


StGH1 antisense RNA 


5 '-GTGAAAGTTTGA ACGCACAC-3 ' 


StGH2 sense RNA 


5 '-AGTGCCATAACATGCTTTCC -3' 


StGH2 antisense RNA 


5 '-CACATTTCAGCTGTTG ATGG A-3 ' 



Table 1 



5 ul of the reverse transcription reaction was used in triplicate analyses together with 
20 specific sequence detection primers, TaqMan MGB probe (see table 2) and UMM 
mastermix (Applied Biosystems) and determined with real-time PCR according to the 
suppliers instructions. 



StGH1 

Forward Primer: TCG AGTCG CC ACGTAG AACTC 
Reverse primer: GAAATGCGTATGCGACTATGATG 
TaqMan probe: AGTCTCTCG G AGTTCC 

StGH2 

Forward primer: GGTGCTGATCCTCCAGTTCTCT 
Reverse primer: GTCCCTG AAGCATAACC A AG GT 
TaqMan probe: TTCTGCACTACTTAGGCCT 



25 Table 2 
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Down-regulation of the two genes resulted in a decrease in gene expression in trans- 
genic lines compared to their mother varieties in the order of 50-95 %. 



Over-expression of the two genes resulted in a 2-10 times increase in gene expression 
5 in transgenic lines compared to their mother varieties. 



Line No. 


oonsiruci 


vanexy 


A r*t e 
A OX S 


Times increase or decrease 

in gene expression 
comparea xo paremai line 


rUl-U*rl-o*t 


n UC1 


Producent 


A A A 

-1,14 


-1 ,o 




n MC1 


Producent 


o no 
-2,03 


-4,1 




-UC1 

phbl 


Producent 


-1 ,47 




PU2-325-11 


pHSl 


Producent 


-1,25 


-1,6 


PU2-325-15 


pHSl 


Producent 


-2,52 


-6,3 


P02-325-25 


pHSl 


Producent 


-2,0 


-4 


P02-325-27 


pHSl 


Producent 


-1 ,64 


-2,7 


P02-325-33 


pHSl 


Producent 


-1,59 


-2,5 


P02-325-34 


pHSl 


Producent 


-1,52 


-2,3 


PQ2-325-63 


pHSl 


Producent 


-1,53 


-2,3 


P02-300-37 


pHS2 


Prevalent 


-1,27 


-1,6 


PU^-jUU-OO 


pnbZ 


Prevalent 


-1,04 


-1 ,1 






rTevaieni 


-1 1Q 
-I , IO 


-1 ^ 

i ,o 






1 f CVCUCI ll 


-1 1 


-1 2 


P02-300-80 


DHS2 


Prevalent 


-2 12 


-4 5 


P02-300-127 


pHS2 


Prevalent 


-1 ,67 


-2,8 

to|W 


PQ2-3Q0-14Q 


., PHS2 


Prevalent 


-3,96 


-15,7 


P02-303-31 


pHS2 


Prevalent 


-1,16 


-1,4 


P02-303-64 


pHS2 


Prevalent 


-1,15 


-1,3 


P02-305-54 


pHS2 


Prevalent 


-1,33 


-1,8 


P02-320-24 


pHS2 


Prevalent 


-1,03 


-1.1 


P02-307-4 


pHS3 


Desiree 


1,82 


3,3 


P02-307-5 


pHS3 


Desiree 


2,68 


7,2 


P02-307-12 


pHS3 


Desiree 


2,67 


7,1 


P02-307-14 


pHS3 


Desiree 


1,83 


3,3 


P02-307-15 


pHS3 


Desiree 


1,79 


3,2 


P02-307-33 


pHS3 


Desiree 


3,21 


10,3 


P02-307-43 


pHS3 


Desiree 


2,7 


7,3 


P02-307-51 


pHS3 


Desiree 


2,73 


7,5 


P02-307-80 


pHS3 


Desiree 


2,78 


7.7 


P02-307-87 


pHS3 


Desiree 


1,02 


1,1 


P02-307-148 


pHS3 


Desiree 


1,88 


3,5 
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Line No. 


Construct 


Variety 


A Ot S 


Times increase or decrease 
in gene expression 
compared to parental line 




pHS3 


AM99-2Q03 


1,64 


2,7 


Dfo ono 111 


pH53 


AM99-2003 


1,34 


1,8 




{Jnjj 




1 ,/D 




P02-311-59 


pHS3 


AM99-2003 


1.17 


1,4 


P02-3 12-15 


pHS4 


AM99-2003 


1,03 


1.1 


P02-313-21 


pHS4 


AM99-2003 


1,54 


2,4 


P02-317-2 


pHS4 


AM99-2003 


1,2 


1,4 



Table 3: Gene expression analysis based on Real- Time PGR 



Example 8 
5 Dry matter analysis 

Dry matter has been analyzed on microtubers from transgenic lines transformed with 
pHS1, pHS2, pHS3 and pHS4 showing a down-regulation or over expression of the 
genes. Since starch normally contribute to more than 80% of the dry matter in potato 
10 tubers, an increase or decrease in starch content will affect also the dry matter content 

Two microtubers of each line were harvested when they had reached maturity. Dry 
matter was calculated for mature microtubers weighed before and after 72 hours drying 
at 60°C. For comparison microtubers from the varieties Dinamo, Desiree, Prevalent, 

15 Producent and P737 with starch contents between 13 and 28% (when grown in field) 
were used. The starch content of microtubers is not as high as starch content of field 
grown tubers. However dry matter content can readily be compared in microtubers and 
that value is well correlated to the determined starch content in field grown tubers. In 
table 4 the average dry matter for the different varieties, calculated on ten or more 

20 microtubers, is shown. 



Variety 


Starch content field 
grown tubers 


Dry matter 
microtubers 


AM99-2003 


13% 


14,8 


Desiree 


I 16% 


16,1 


Producent 


22% 


19,2 


Prevalent 


22% 


19,7 


P737 


28% 


21,6 



Table 4: Dry matter content of 5 varieties based on 10 or more microtubers 
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One of each pHS1 and pHS2 with confirmed decrease in gene-expression have been 
analyzed for dry matter so far. Those two have a decrease in dry matter of 7 and 1 1 % 
compared to their mother varieties. 
5 For the pHS3 lines 8 of 9 of the confirmed over-expressed lines show an increase of up 
to 36% in dry matter. See table 5. 



Line No. 


Construct 


Variety 


Dry matter in relation to parental 
line (%) 


41-84 


pHS1 


Producent 


89 


300-127 


pHS2 


Prevalent 


93 


300-140 


PHS2 


Prevalent 


96 


307-4 


pHS3 


Desiree 


106 


307-5 


pHS3 


Desiree 


117 


307-15 


pHS3 


Desiree 


124 


307-33 


pHS3 [ 


Desiree 


116 


307-57 


pHS3 


Desiree 


136 


309-63 


pHS3 


AM99-2003 


134 


309-106 


pHS3 


AM99-2003 


109 


309-111 


pHS3 


AM99-2003 


108 



Table 5: Dry matter content on transgenic lines with confirmed 
10 down-regulation or over-expression of the StGH1 and StGH2 genes 

Example 9 

Starch content analysis 

15 For analysis of starch content a total starch assay procedure from Megazyme Inter- 
ational Ireland Ltd., Bray, Co.Wicklow, Ireland ( AOAC Method 996.1; AACC method 
76.13; ICC standard method No. 168) was used according to the suppliers instructions. 

Starch content was analysed on microtubers from all transgenic lines transformed 
20 with pHSl , pHS2, pHS3 and pHS4. The microtubers were harvested when they had 
reached maturity. Mature microtubers were ground and maltosaccharides and free 
glucose residues were washed away with ethanol. The microtuber starch was treated 
with DMSO to ensure the complete solubilisation of samples with high levels of resis- 
tant starch, as the high amylose clones. 

25 



Samples were analyzed with a standard spectrophotometric assay procedure. The 
transgenic lines were compared to potato varieties with known starch content ranging 
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from 8% to 30%. The results give an indication on the change in starch content related 
to the genetic modification of the different transgenic lines. 

Example 10 
5 Greenhouse trial 

Harvest weight and dry matter content was measured on pHS3 and pHS4 transgenic 
lines and their mother varieties grown in the greenhouse. The harvest weight from 10 
greenhouse grown pots was measured, see table 6. 



Line No. 


Construct 


Parental 
line 


1 ICll VCdl 


increase in narvesi 
wtJiyni wUinjJcircLi 10 

narontol lino 
pdltJilldl 111 IV 

/o 


AM99-2003 






1150 

1 1 WW 




Desiree 






1500 

1 www 




P02-307-4 


pHS3 


Desiree 


1Q00 

1 www 


c / 


P02-307-5 


pHS3 


Desiree 

t^r WWII VC 


2050 


o / 


P02-307-12 


pHS3 


Desiree 

fc^ WWII WW 


2250 

fc. C w w 


^n 


P02-307-15 


pHS3 


Desiree 

L/COII ww 


1Q50 

1 W V? \J 


ou 


P02-307-33 


pHS3 


Desiree 


1950 


30 


P02-307-43 


pHS3 


Desiree 


2000 


33 


P02-307-51 


pHS3 


Desiree 


1950 


30 


P02-307-80 


pHS3 


Desiree 


1750 


17 


P02-309-63 


pHS3 


AM99-2003 


1550 


35 j 


P02-309-114 


pHS3 


AM99-2003 


1250 


9 


P02-316-25 


pHSS 


AM99-2003 


1450 


26 


P02-316-111 


pHS3 


AM99-2Q03 


1300 


13 


P02-314-4 


pHS4 


Desiree 


1650 


10 


P02-314-15 


pHS4 


Desiree 


1600 


7 


P02-314-35 


pHS4 


Desiree 


1850 


23 


P02-314-40 


pHS4 


Desiree 


1700 


13 


P02-312-15 


pHS4 


AM99-2003 


1350 


17 


P02-313-21 


pHS4 


AM99-2003 


1300 


13 


P02-313-42 


pHS4 


AM99-2003 


1650 


43 


P02-317-2 


pHS4 


AM99-2003 


1350 


17 



Table 6: Greenhouse harvest weight of lines over-expressing StGH1 (pHS3) or StGH2 
(pHS4). 
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The harvest weight was increased up to 43% in the transgenic lines compared to their 
mother varieties. The results show that over-expression of StGH1 and StGH2 results in 
an increase in total harvest weight. 

5 The lines grown in the greenhouse were analyzed for dry matter content. Slices of 
three tubers of each line were dried in a freeze dryer for 72 hours with weighing prior 
and after the drying. The dry matter results can be seen in table 7 and present a mean 
value of the three analyses. 



Line i\io. 


oonsiruci 


raremai line 


Mean vaiue 


Increase in dry matter 
in comparison with 
parental line 

0/ 

TO 


AMQQ OfifiQ 

mivi y y-^uuo 






iy, i o 




uesiree 






19,25 






pnoo 


Desiree 


21 ,29 


1 1 




pnoo 


Desiree 


19,92 


o 
3 


□no o/V7 i o 


-LJCO 

pnoo 


Desiree 


22,4o 


1 / 


□no Of\~7 H A 

KU^-oU/- 14 


pnoo 


Desiree 


20,45 


6 




pnoo 


Desiree 


22,12 


15 


rut ou/ oo 


pnoo 


U65ire@ 


I ,uo 


Q 


P02-307-43 


pHS3 


Desiree 


19,68 


2 


P02-307-80 


pHS3 


Desiree 


19,93 


4 


P02-309-63 


pHS3 


AM99-2003 


21,24 


11 


P02-309-111 


pHS3 


AM99-2003 


23,44 


22 


P02-309-114 


pHS3 


AM99-2003 


22,55 


18 


PQ2-311-59 


pHS3 


AM99-2003 


21,83 


14 


P02-316-25 


pHSS 


AM99-2003 


19,60 


2 


P02-316-111 


pHS3 


AM99-2003 


24,18 


26 


P02-318-12 


pHS3 


AM99-2003 


23,27 


22 


P02-314-4 


pHS4 


Desiree 


19,77 


3 


P02-313-21 


pHS4 


AM99-2003 


21,08 


10 


P02-313-42 


pHS4 


AM99-2003 


20,61 


8 


P02-317-2 


pHS4 


AM99-2003 


20,76 


8 


P02-317-15 


pHS4 


AM99-2003 


19,45 


2 



10 

Table 7: Analysis of dry matter content in lines over-expressing StGH1 (pHS3) or 
StGH2 (pHS4) grown in field trial. 

Analyses of the transgenic lines over expressing the StGH1 or StGH2 genes show an 
15 increase in dry matter content compared to its respective parental line. As can be seen 
in table 7 the dry matter is increased up to 26% in lines over-expressing the StGH1 
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gene. The increase in dry matter is more pronounced when AM99-2003 is used as 
parental line. This is due to the fact that AM99-2003 is containing a significant amount 
of available sugars as a consequence of the high amylose trait (see figure 2). Also in 
lines over-expressing StGH2 the dry matter is increased. The increase is also in his 
5 case higher when AM99-2003 is used as parental line. Lines over-expressing StGH2 
have an increase in dry matter of up to 10%. 

Example 1 1 

Field-trial of transgenic potato lines 

10 

Transgenic lines as described in examples 5 to 10 are tested in field trials for the 
determination of agronomic performance in relation to the parental line and other varie- 
ties used for starch production. Starch content, which is a main agronomic factor of 
importance for crops used for starch processing, can be measured by several 
1 5 different methods. 

Under water weighing of tubers is performed on a scale in a tub of water. Starch 
content was determined according to standard procedure. 5 kg potato is used for 
the measurement and starch content is calculated according to the formula: Starch 
20 content in % = (density of potato -1 .01 506) / 0.0046051 . An increase in starch content 
is associated with an increase in the density of the sample. An increase of starch in the 
tubers is associated with an increased dry matter content, which can be measured by 
comparing the tissue fresh weight to tissue dry weight after extensive water elimination 
in an oven at 105°C for 16 hours. 

25 

Starch content can also be measured by enzymatic methods as described under starch 
content analysis in example 9 and 12. 

Example 12 
30 Results in field-trial 

Five lines over-expressing StGH1 or StGH2 were grown in the field as cuttings. The 
growth period was within June to September. After harvest the lines where analyzed 
for dry matter content, starch content and sugar content, see results in tables 8-10. 

35 

The dry matter content was analyzed by drying 15 g of mashed potatoes (produced 
in a blender) in a fanned heating oven for 16-18 hours at 105°C. The samples were 
cooled down to room temperature in an exicator before measurement. 
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The starch content from field grown tubers was analyzed according to an enzymatic 
method described in P. Aman et al. Methods in Carbohydrate Chemistry Vol. X. 1994, 
pp.1 1 1-115 by using a thermostable ot-arnylase. Duplicate analysis was made on 
ground and dried samples of tubers diluted in ethanol (80%) and digested by thermo- 
5 stable oc-amylase and amyloglucosidase. The amount of starch was determined by a 
glucose oxidase reaction. 

The concentration of fructose, glucose and sucrose was determined using gas-liquid 
chromatography by methods described by Georg Fuchs et al., Swedish J. Agric. 
10 Res. 4:49-52, 1974, Quantitative determination of low-molecular carbohydrates in 
foods by gas-liquid chromatography. 



Line No. 


Construct 


parental 
line 


Dry matter 

% 


Increase in 
dry matter 

% 


AM99-2003 






20,7 




Desiree 






21,8 




P02-307-33 


pHS3 


Desiree 


22,6 


4 


P02-307-80 


pHS3 


Desiree 


22,3 


2 


P02-309-63 


pHS3 


AM99-2003 


21,9 


6 


P02-309-106 


pHS3 


AM99-2003 


21,2 


2 


P02-313-21 


pHS4 


AM99-2003 


22,2 


7 



Table 8: Analysis of dry matter content in lines over-expressing StGH1 or StGH2 
15 grown in field-trials. Results presented are mean values of two analyses. 

As can be seen in table 8 the field grown transgenic lines over-expressing StGH1 or 
StGH2 show an increase in dry matter of up to 7%. The lines also show an increase in 
starch content as can be seen in table 9. The highest increase can be seen for lines 
20 with AM99-2003 as parental line. This is due to the access of sugars available in the 
high amylose parental line (figure 2). 
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Line No. 


Construct 


parental 
line 


Starch con- 
tent 

% 


Increase of 
starch con- 
tent 

% 


AM99-2003 






13,3 




Desiree 






16,7 




P02-307-33 


pHS3 


Desiree 


17,5 ; 


5 


PO2-309-63 


pHS3 


AM99-2003 


14,6 


10 


P02-309-106 


pHS3 


AM99-2003 


14,9 


12 


P02-313-21 


pHS4 


AM99-2003 


15,4 


16 



Table 9: Analysis of starch content determined by an enzymatic and a gravimetric 
method. The lines over-expressing StGH1 (pHS3) or StGH2 (pHS4) were 
grown in field-trials. 

5 

Furthermore the sugar concentrations were analyzed in lines with AM99-2003 as 
parental line. AM99-2003 contains a high fraction of available sugars due to the high 
amylose trait. In the lines over-expressing the StGH1 or the StGH2 gene the concen- 
tration of glucose has been reduced to 1/3 and sucrose has been reduced to 3 / 4 of the 
10 amount analyzed in the parental line (see table 10). The lower sugar concentrations in 
the lines over-expressing the StGH1 or StGH2 gene show that more glucose and 
sucrose has been incorporated in the starch biosynthesis resulting in an increase in 
starch content in these transgenic lines. 



Line No. 


Construct 


parental 
line 


Fructose 
% of DM 


Glucose 
% of DM 


Decrease in 
glucose 
content 

% 


Sucrose 
% of DM 


Decrease in 
sucrose 
content 

% 


AM99-2003 






0,01 


1,2 




3,25 




PO2-309-63 


pHS3 


AM99-2003 


0,02 


0,91 


-24 


2,74 


-16 


P02-309-106 


pHS3 


AM99-2003 


0,01 


0,62 


-48 


2,56 


-21 


P02-313-21 


pHS4 


AM99-2003 


0,01 


0,39 


-68 


2,39 


-26 



15 ' — — 

Table 10: Analysis of fructose, sucrose and glucose content in lines over-expressing 
StGH1 (pHS3) or StGH2 (pHS4) grown in field trial. DM = dry matter. 



WO 2004/078983 PCT/EP2004/002096 

45 

Example 1 3 

Microscopic investigation of lines over-expressing the StGH1 or the StGH2 gene. 

Field grown tubers of transgenic lines over-expressing the StGH1 or the StGH2 gene 
5 with AM99-2003 as parental line were investigated for starch granule morphology 
by staining starch with iodine (Lugol's solution (6.7 g/l Kl +3.3 g/l l 2 ) and glycerol 1:1). 
A piece of a tuber was crushed and a few drops of iodine solution were added. The 
starch granule structure was analyzed under the microscope. 

As can be seen in the figure 10, the starch granules are collapsed towards the interior 
10 of the granule in the high amylose parental variety AM99-2003. In contrast to this, the 
starch granules from the transgenic lines over-expressing the StGH1. or the StGH2 
gene are - see figures 1 1 and 12 - larger and rounded in shape. This is due to the 
increased starch incorporation in the granules as a result of the over-expression of the 
StGH1 or the StGH2 gene. 

15 

Example 14 

Combined expression of genes related to the starch initiation 

StGH1 and StGH2 can be combined in different ways. The genes can be combined 
20 on the same T-DNA or be located on separate T-DNAs. The genes can be used for 
co-transformation or be combined by crossing of transgenic lines. 

Example 1 5 

Combined constructs for inhibition of SBE1 and SBE2 and over-expression of StGHl 
25 or StGH2 

Constructs were made for production of high amylose lines containing high starch 
content. The constructs pHASHS2, pHASHS4, pHASHSS and pHASHSS were made 
for over-expression of StGHl or StGH2. All pHASHS constructs also contain fragments 

30 of be1 and be2 for down-regulation of respective genes with the antisense technique or 
RNA interference technique (RNAi). The down-regulation of the SBE1 and SBE2 genes 
inhibits the amylopectin biosythesis and directs the starch biosythesis towards in- 
creased amylose production. All constructs are based on the binary vector pSUNA- 
HASmodb. The RNAi constructs are based on vector pHAS8b and the antisense con- 

35 structs are based on vector pHAS4b. 

Plants transferred with pHASHS2, pHASHS4, pHASHSS or pHASHS6, yielded high- 
arnylose lines. The starch content in the produced transgenic lines was higher than in 
high-amylose lines not over-expressing the StGHl or StGH2 gene. The starch content 
40 was in the same range as can be seen for pHS3 and pHS4 lines described above. 
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Example 16 

Combined constructs for inhibition of SBE1 and SBE2 and over-expression of StGH1 
and StGH2 

5 

Constructs were made for production of high amylose lines containing high starch 
content. pHASHS7 and pHASHS8 were made for over-expression of StGH1 and 
StGH2 together in one plant. All pHASHS constructs also contain fragments of SBE1 
and SBE2 for down-regulation of respective genes with the antisense technique or 
10 RNA interference technique (RNAi). The down-regulation of the SBE1 and SBE2 genes 
inhibits the amylopectin biosythesis and directs the starch biosythesis towards in- 
creased amylose production. All constructs are based on the binary vector pSUNA- 
HASmodb. The RNAi constructs are based on vector pHAS8b and the antisense con- 
structs are based on vector pHAS4b. 

15 

Plants transformed with pHASHS7 and pHASHS8 yielded high-amylose lines. The 
starch content in the produced transgenic lines was higher than in high-amylose lines 
not over-expressing the StGH1 and StGH2 genes together. 

20 Example 17 

Vector Constructions 

Construction of pSUNAHASmodb 

25 A binary vector based on pSUN1 with a mutated AHAS gene as selection marker was 
constructed. The vector was used for further cloning of trait genes. 

A 608 bp fragment containing the nos promoter was cut out from pGPTV-kan with Hind 
III and Bglll and was ligated to pUC19 (Invitrogen) cut open with Hindlll and BamHI. 

30 The nos terminator (275bp) was cut out from pGPTV-kan with Sstl and EcoRI and 

ligated to above between the Sstl and EcoRI sites. The AHAS gene (S653N) described 
by Sathasivan et al (1991) was optimised by elimination of the restriction sites Hindlll, 
EcoRV, BamHI, EcoRI and Sstl by using QuikChange Multi Site directed Mutagenesis 
kit (Stratagene). Additional restriction sites, Kpnl and Sstl was added at the 3 and 5' 

35 ends of the gene. The gene was named AtAHASmod (figure 22, SEQ ID NO 16). 
AtAHASmod was cut with Kpnl and Sstl (ca 201 9bp) and ligated between the nos 
promoter and nos terminator at Kpnl-Sstl. The above complex was cut out from pUC19 
with Hindlll (blunt) and EcoRI (2900bp) and ligated to pSUN1 at EcoRI and Smal. The 
vector was named pSUNAHASmodb, see figure 19. 
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Construction of pHAS8b 



An RNAi construct pHAS8b with a be1 and be2 fragment (SEQ ID NO 20) for down- 
regulation of the branching enzyme genes was constructed in the binary vector pSU- 
5 NAHASmodb (figure 19) based on pSUN1 with a mutated AHAS gene (SEQ ID NO 16) 
as selection marker. As spacer a fragment of the be2 promoter was used (SEQ ID NO 
18). The vector was used for extended cloning with the StGH1 (SEQ ID NO 1) and 
StGH2 (SEQ ID NO 3) genes. 

10 A 400 bp synthetically produced fragment of be2 (200bp) and be1 (200bp) in 

pBluescript named RNAi420be2be1 (SEQ ID NO 19) was opened with Hindlll (blunt) 
and Sail (3331 bp). A 262bp fragment of the be2 promoter (SEQ ID NO 18), for use as 
spacer, was digested with Bglll (blunt) and Sail and ligated to the vector and named 
pMA17. 

15 

Again the 400 bp RNAi420be2be1 fragment was used and ligated in inverted direction 
to pMA17 opened with Xhol (blunt) and Kpnl (3582 bp). The construct was named 
pMA18. pMA18 was digested with Spel and Kpnl (1 120 bp) and the fragment was 
ligated between a gbss promoter and a nos terminator in pUC19 at Xbal and Kpnl 
20 (3924 bp). The construct was named pMA19b. pMA19b was digested with Pvull and 
Hindlll (2390 bp) and ligated to the binary vector pSUNAHASmodb (figure 19) between 
Sphl (blunt) and Hindlll (8932 bp). The construct was named pHAS8b, see figure 20. 

Construction of pHAS4b 

25 

A vector pHAS4b with an antisense fragment for down-regulation of SEE1 and SBE2 
was constructed in the binary vector pSU NAHASmodb (figure 19) based on pSUN1 
with a mutated AHAS gene (SEQ ID NO. 16) as selection marker. The vector was used 
for extended cloning of the StGH1 and the StGH2 genes. 

30 

The antisense fragment of be1 and be2 together with gbss promoter and nos termina- 
tor was cut out of pHAbe12A (figure 9, SEQ ID NO 15) with BsrBI and Hindlll (4299bp) 
and ligated to pSUNAHASmodb (figure 19) digested with Sphl (blunt) and Hindlll 
(8932bp). The construct was named pHAS4b, see figure 21 . 

35 

Construction of pHASHS5 

The StGH1 gene, the gbss promoter and the nos terminator was digested from pHS3 
(figure 6) and cloned into the RNAi construct pHAS8b (figure 20) containing fragment 
40 of be1 and be2. 
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The StGH1 gene, gbss promoter and nos terminator was cut out from pHS3 with Dral 
and EcoRV (3160bp). The fragment was ligated to pHAS8b opened up with EcoRV 
(1 1403bp). The construct was named pHASHS5, see figure 15. 

5 

Construction of pHASHS6 

The StGH2 gene, the gbss promoter and the nos terminator was digested from pHS4 
(figure 7) and cloned into the RNAi construct pHAS8b (figure 20) containing fragments 
10 ofbe1andbe2. 

pHS4 was digested with Spel and EcoRI. Two fragments were collected for further 
cloning, a 2486bp EcoRI-EcoRI fragment and a 1169bp Spel-EcoRI fragment. 
pBluescript was digested with Spel and EcoRI. The digested pBluescript was ligated 
1 5 with the 1 1 69bp (Spel-EcoRI) fragment. The construct was named pMA1 5. 

pMA15 was digested with EcoRI (4127bp) and ligated to the 2486bp EcoRI-EcoRI 
fragment from pHS4.The construct was named pMA1 6. A 3689bp fragment was cut 
out rom pMA16 digested with EcoRV and ligated to pHAS8b opened with EvoRV 
(1 1403bp). The construct was named pHASHSS, see figure 16. 

20 

Construction of pHASHS2 

The StGH1 gene, the gbss promoter and the nos terminator was digested from pHS3 
(figure 6) and cloned into pHAS4b (figure 21) containing an antisense fragment of be1 
25 and be2. 

pHS3 was digested with Dral and EcoRV (3160bp) and ligated to pHAS4b opened up 
with EcoRV (13224bp). The construct was named pHASHS2, see figure 13. 

30 Construction of pHASHS4 

The StGH2 gene, the gbss promoter and the nos terminator was digested from pHS4 
(figure 7) and cloned into pHAS4b (figure 21) containing an antisense fragment of be1 
and be2. 

35 

A 3689bp fragment was digested from pMA16 with EcoRV (661 3bp) (for pMA16 see 
construction strategy of pHASHS3). pHAS4b was digested with EvoRV (13224bp) and 
ligated with the above fragment. The construct was named pHASHS4, see figure 14. 
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Construction of pHASHS7 

pHASHS7 was designed to contain antisense fragments of be1 and be2 for inhibition of 
respective gene together with the two amylose biosynthesis enhancing genes StGH1 
5 and StGH2. 

pMA16 (for pMA16 see construction strategy of pHASHS3) was digested with EcoRV. 
The resulting 3649bp fragment was ligated to pHASHS2 (figure 5) opened with Pstl 
(blunted). The construct was named pHASHS7, see figure 1 7. 

10 

Construction of pHASHS8 

pHASHS8 was designed to contain a fragment of be1 and be2 for inhibition of respec- 
tive gene using RNAi (SEQ ID NO 1 9 to 22) and linked by a spacer (SEQ ID NO 18 or 
1 5 23) together with two amylose biosynthesis enhancing genes StGH1 and StGH2. 

pMA1 6 (for pMA1 6 see construction strategy of pHASHS3) was digested with EcoRV 
and ligated to pHASHSS (figure 15) opened with Pstl and blunted. The construct was 
named pHASHSS, see figure 18. 

20 

Example 18 

Increased solids and improved processing quality of potatoes 

25 In another aspect the invention may be used to increase the solids content of potato 
varieties that are used for processed potato products or as table potato varieties. The 
potato genotypes are transformed with gene constructs as described above for the 
over-expression of a gene coding for a starch biosynthesis enhancing protein. This 
starch biosynthesis enhancing protein may be derived from genes described above or 

30 other plant genes containing the same functional domains. 

Over-expression of the StGH1 and/or the StGH2 gene in potato plants results in an 
increase in solids as can be seen in table 7 and 8. 
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1 . A method of increasing the production of starch in plants comprising culturing a 
plant with enhanced expression or activity of at least one starch biosynthesis 

5 enhancing protein. 

2. The method of claim 1, wherein said starch has a high amylose content. 

3. The method as claimed in either claim 1 or 2, wherein production of amylose is 
10 increased. 



4. The method as claimed in any one of claims 1 to 3, wherein said method 
comprises over-expression of a starch biosynthesis enhancing protein. 

15 5. The method as claimed in claim 4, wherein said protein comprises the 

SEQ ID NO: 2 or 4 or a protein derived from this sequence by substitution, 
insertion or deletion of amino acids and which has at least 50% identity at the 
amino acid level with SEQ ID NO: 2 or 4. 



20 6. The method as claimed in any of claims 1 to 5, wherein the starch biosynthesis 
enhancing protein is encoded by a nucleic acid sequence selected from the 
group consisting of: 



a) a nucleic acid sequence comprising a nucleotide sequence which is at least 
25 60% identical to the nucleic acid sequence of SEQ ID NO: 1 or 3; 

b) a nucleic acid sequence comprising a fragment of at least 30 nucleotides of 
a nucleic acid sequence comprising the nucleotide sequence of SEQ ID 
NO:1 or 3; 



30 



c) a nucleic acid sequence which encodes a polypeptide comprising an amino 
acid sequence at least about 60% identical to the amino acid sequence of 
SEQ ID NO:2 or 4 and 



35 d) a nucleic acid sequence which encodes a fragment of a polypeptide 

comprising the amino acid sequence of SEQ ID NO: 2 or 4 or wherein the 
fragment comprises at least 10 contiguous amino acid residues of the 
amino acid sequence of SEQ ID NO:2 or 4, 
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7. The method as claimed in any one of claims 1 to 6, wherein the starch 

biosynthesis enhancing protein is encoded by a nucleic acid sequence 
comprising the nucleotide sequence set forth as SEQ ID NO:1 or SEQ ID NO:3. 

8. The method as claimed in any one of claims 1 to 7, wherein deficiency or 
decreased activity is achieved by a method selected from the group consisting 
of: 



10 



a) knock-out of the gene encoding said protein; 

b) mutagenesis of the gene encoding said protein, wherein said mutation can 
be induced in the coding, non-coding, or regulatory regions of said gene; 

c) expression of an anti-sense RNA, wherein said anti-sense RNA is 
1 5 complementary to at least part of the RNA encoding said protein; 

9. A method of producing amylose type starch by culturing a plant which over- 
expresses SEQ ID NO:1 or 3 or has increased starch biosynthesis enhancing 
activity under conditions such that the plant produces an increased amount of 

20 amylose type starch. 

10. The method of any of the preceeding claims, wherein said plant belongs to the 
genus Solanum. 

25 11. The method of claim 1 0, wherein said plant is Solanum tuberosum. 

12. A nucleic acid sequence SEQ ID NO:1 encoding a starch biosynthesis 
enhancing protein. 

30 13. A nucleic acid sequence SEQ ID NO:3 encoding a starch biosynthesis 
enhancing protein. 



35 



14. An amino acid sequence SEQ ID NO:2 having starch biosynthesis enhancing 
activity. 

15. An amino acid sequence SEQ ID NO:4 having starch biosynthesis enhancing 
acitivity. 
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A transgenic expression cassette comprising in combination with a regulatory 
sequence a nucleic acid sequence selected from the group consisting of: 

a) a nucleic acid sequence comprising a nucleotide sequence which is at 
least 60% identical to the nucleotide sequence of SEQ ID NO:1 or 
SEQ ID NO:3, 

b) a nucleic acid sequence comprising a fragment of at least 30 nucleotides 
of a nucleic acid sequence comprising the nucleotide sequence of 
SEQ ID NO:1 or SEQ ID NO:3, 

c) a nucleic acid sequence which encodes a polypeptide comprising an 
amino acid sequence at least about 60% identical to the amino acid 
sequence of SEQ ID NO:2 or SEQ ID NO:4, or 

d) a nucleic acid sequence which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:2 or SEQ ID NO:4 
wherein the fragment comprises at least 10 contiguous amino acid 
residues of the amino acid sequence of SEQ ID NO:2 or SEQ ID NO:4 

wherein said regulatory sequence is capable of mediating expression of said 
nucleic acid sequence in a plant. 

A transgenic expression cassette of claim 16, wherein said regulatory sequence 
is a promoter sequence heterologous with regard to said nucleic acid sequence. 

A transgenic expression cassette of claim 16, wherein said regulatory sequence 
is a tuber specific promoter sequence. 

A transgenic expression cassette of either claim 16, 17 or 18, wherein said 
nucleic acid sequence is arranged in antisense or sense orientation with regard 
to said promoter sequence. 

A transgenic expression cassette of any of the claims 16 to 19, wherein said 
nucleic acid sequence encodes a polypeptide comprising the amino acid 
sequence set forth in SEQ ID NO:2 or SEQ ID NO:4. 

A transgenic expression cassette of any of the claims 16 to 20, wherein said 
nucleic acid molecule comprises the nucleotide sequence set forth in SEQ ID 
NO:1 or SEQ ID NO:3. 
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22. A transgenic expression cassette of any of the claims 1 6 to 21 , wherein said 
nucleic acid sequence encodes a naturally occurring variant of a polypeptide 
comprising the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID NO:4. 

23. A transgenic host cell transformed with an expression cassette of any of the 
claims 16 to 22. 

24. A transgenic host cell of claim 23, wherein said host cell belongs to the genus 
Solanum. 

25. A transgenic plant comprising an expression cassette of any of claims 1 6 to 22. 

26. A transgenic potato plant comprising an expression cassette of any of claims 1 6 
to 22. 

27. A transgenic potato plant, plant part, seed or tuber comprising an expression 
cassette of any of claims 1 6 to 22. 
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SEQUENCE LISTING 

<110> BASF Plant Science GmbH 

5 <120> Enhanced Amylose Production in Plants 

<130> AE884-02 

<140> PF0000054331 
10 <141> 2003-03-07 

<160> 24 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 2084 

<212> DNA 

<213> Solanum tuberosum 

<220> 
<221> CDS 

<222> (302) . . (1696) 



15 



20 



35 



55 



6 



25 <400> 1 

ttttcataaa cttcttcaac tttattccat actcttttat ttatcagctc ctagatcttc 60 

ttttttgttt gttgatattc tcttgaaaat tgttcagtga agagttgatc aaagctaaga 12 0 

30 cacaggggct gcggccattt tttcaccgga atcttcttct ttattttccg gtgaaagttt 180 

gaacgcacac cgttatttct agacagtaga caatgtcaag tgaaaaacat cacaagtttt 240 

tgaagatttg taattaatta gttgagattt ttaatttgga ggaaagagaa aaacagagaa 300 



g atg ata ggg egg gtg ggc ttg ttg ttg gta ttg ttg ata gca acg acg 3 49 
Met lie Gly Arg Val Gly Leu Leu Leu Val Leu Leu lie Ala Thr Thr 
15 10 15 



40 gtg act att ggg get gaa acg acg acg tta aaa ggg gta aac aga aat 3 97 

Val Thr lie Gly Ala Glu Thr Thr Thr Leu Lys Gly Val Asn Arg Asn 
20 25 30 

gcg tat gcg act atg atg tat atg gga act ccg aga gac tac gag ttc 445 

45 Ala Tyr Ala Thr Met Met Tyr Met Gly Thr Pro Arg Asp Tyr Glu Phe 
35 40 45 

tac gtg gcg act cga gta atg etc cga tea ctt ace egg eta gga gtt 493 

Tyr Val Ala Thr Arg Val Met Leu Arg Ser Leu Thr Arg Leu Gly Val 

50 50 55 60 

gaa gee gat etc gtc gtt att get tea ctt gac gtt cct ctt cgc tgg 541 

Glu Ala Asp Leu Val Val lie Ala Ser Leu Asp Val Pro Leu Arg Trp 
65 70 75 80 



gtt caa act eta gaa cag gaa gat ggt get aag gtg gtg aga gtt aaa 589 
Val Gin Thr Leu Glu Gin Glu Asp Gly Ala Lys Val Val Arg Val Lys 
85 90 95 



60 aat ctg aac aat ccg tat tgt ate aac cct aat tgg aga ttc aag etc 637 
Asn Leu Asn Asn Pro Tyr Cys lie Asn Pro Asn Trp Arg Phe Lys Leu 
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100 105 110 

aca ctg aac aaa ctt tat gcg tgg age etc gta aat tat gac agg gtt 685 
Thr Leu Asn Lys Leu Tyr Ala Trp Ser Leu Val Asn Tyr Asp Arg Val 
115 120 125 

gtc atg ctt gat get gac aac ctt ttc etc cag aaa act gat gaa ctg 733 
Val Met Leu Asp Ala Asp Asn Leu Phe Leu Gin Lys Thr Asp Glu Leu 
130 135 140 

ttc caa tgt ggc cag ttt tgt get gtc ttc att aat ccc tgc ate ttc 781 
Phe. Gin Cys Gly Gin Phe Cys Ala Val Phe lie Asn Pro Cys lie Phe 
145 150 155 160 

cac act ggt etc ttt gta ttg cag cca tea aaa aag gtg ttc aat gac 829 
His Thr Gly Leu Phe Val Leu Gin Pro Ser Lys Lys Val Phe Asn Asp 
165 170 175 

atg ate cat gag ata gag att ggg agg gaa aat caa gac ggt gca gac 877 
Met lie His Glu lie Glu He Gly Arg Glu Asn Gin Asp Gly Ala Asp 
180 185 190 

caa ggt ttt att gga ggc cac ttc cca gat tta ctt gat egg cca atg 925 
Gin Gly Phe He Gly Gly His Phe Pro Asp Leu Leu Asp Arg Pro Met 
195 200 205 

ttc cac cct cct ctt aat ggt acc cag etc cag gga agt tac agg ctt 973 
Phe His Pro Pro Leu Asn Gly Thr Gin Leu Gin Gly Ser Tyr Arg Leu 
210 215 220 

cct eta gga tac caa atg gac gee tct tat tat tat etc aaa etc cat 1021 
Pro Leu Gly Tyr Gin Met Asp Ala Ser Tyr Tyr Tyr Leu Lys Leu His 
225 230 235 240 

tgg teg gta cct tgt gga cct aat agt gtc att aca ttt cct ggt get 1069 
Trp Ser Val Pro Cys Gly Pro Asn Ser Val He Thr Phe Pro Gly Ala 
245 250 255 

cca tgg tta aaa cca tgg tat tgg tgg tea tgg cct gtc tta ccc ttg 1117 
Pro Trp Leu Lys Pro Trp Tyr Trp Trp Ser Trp Pro Val Leu Pro Leu 
260 265 270 

ggc ate cag tgg cat gaa cag cga cgt eta act gtt ggg tat ggt get 1165 
Gly He Gin Trp His Glu Gin Arg Arg Leu Thr Val Gly Tyr Gly Ala 
275 280 285 

gag atg ata gca gtg ttg ate caa tct ata ttt tac eta gga ata att 1213 
Glu Met He Ala Val Leu He Gin Ser He Phe Tyr Leu Gly He He 
290 295 300 

gca gtg aca cgc eta gca cgc cca aat tta tea aag ttg tgc tat cgc 1261 
Ala Val Thr Arg Leu Ala Arg Pro Asn Leu Ser Lys Leu Cys Tyr Arg 
305 310 315 320 

cat gat gat age aag agt gee ttc tta eta cga act ggc ctt aaa ttg 13 09 
His Asp Asp Ser Lys Ser Ala Phe Leu Leu Arg Thr Gly Leu Lys Leu 
325 330 335 

att get ata tgg tec att ctt get gee tac aca gtt cct tat ttc gtg 13 57 
He Ala He Trp Ser He Leu Ala Ala Tyr Thr Val Pro Tyr Phe Val 
340 345 350 
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att cct tgt aca gtt cat cca eta gtt ggc tgg agt etc tac tta etc 1405 
He Pro Cys Thr Val His Pro Leu Val Gly Trp Ser Leu Tyr Leu Leu 
355 360 365 

5 

ggc tct ttt tea eta tec tgt ata aca gtg aat gca ttt ctt ttg ccg 1453 
Gly Ser Phe Ser Leu Ser Cys He Thr Val Asn Ala Phe Leu Leu Pro 
370 375 380 

10 atg eta cct gtt tta gtc cca tgg att ggg ate ctt ggg gee ctt ttg 1501 
Met Leu Pro Val Leu Val Pro Trp He Gly He Leu Gly Ala Leu Leu 
385 390 395 400 

gtg atg get tac cct tgg tac aac gac ggt gtt gta aga gca atg get 1549 
15 Val Met Ala Tyr Pro Trp Tyr Asn Asp Gly Val Val Arg Ala Met Ala 

405 410 415 

gta ttt aca tac gee ttc tgt get tct cca gca tta tgg atg gca ttg 1597 
Val Phe Thr Tyr Ala Phe Cys Ala Ser Pro Ala Leu Trp Met Ala Leu 
20 420 425 430 

gtt aaa ate aag tgt tct ctt cat gtt tea ctt gag agg gaa gga ttc 1645 

Val Lys He Lys Cys Ser Leu His Val Ser Leu Glu Arg Glu Gly Phe 
435 440 445 

25 

ttg ccc aag ata agt gaa tct aca gca cct get ggt tct aac aaa ctg 1693 

Leu Pro Lys He Ser Glu Ser Thr Ala Pro Ala Gly Ser Asn Lys Leu 
450 455 460 

30 tat tgaaagttga. aaagttaaag gaatcaacag gagaactaat gcttcagaaa 1746 
Tyr 
465 

catctccaaa cgttttgctt aggagacttg gagtctgett gtgetatect agctagttgc 1806 

35 

ttcagtctgt gctcttaatt age.atggaat tctgtgagtg ggtttagaat tgggaggatg 1866 
ttttgtgttg tacatggact atctctggtc tettgaatge tactccagga aaaagattgt 192 6 
40 ttctcactta attttttctg ttactaaatt gtatgtggaa taggttcttt aaaatttatt 1986 
catggattta tgttatgtat gctaacagtg taaatattaa gtcctggtga aataagtaat 2 046 
tccttattca tacaaaaaaa aaaaaaaaaa aaaaaaaa 2084 

45 

<210> 2 
<211> 465 
<212> PRT 
50 <213> Solanum tuberosum 

<400> 2 

Met He Gly Arg Val Gly Leu Leu Leu Val Leu Leu lie Ala Thr Thr 
15 10 15 

55 

Val Thr He Gly Ala Glu Thr Thr Thr Leu Lys Gly Val Asn Arg Asn 
20 25 30 



60 



Ala Tyr Ala Thr Met Met Tyr Met Gly Thr Pro Arg Asp Tyr Glu Phe 
35 40 45 



WO 2004/078983 PCT/EP2004/002096 

4 

Tyr Val Ala Thr Arg Val Met Leu Arg Ser Leu Thr Arg Leu Gly Val 
50 55 60 

Glu Ala Asp Leu Val Val lie Ala Ser Leu Asp Val Pro Leu Arg Trp 
5 65 70 75 80 

Val Gin Thr Leu Glu Gin Glu Asp Gly Ala Lys Val Val Arg Val Lys 
85 90 95 

10 Asn Leu Asn Asn Pro Tyr Cys lie Asn Pro Asn Trp Arg Phe Lys Leu 
100 105 110 



15 



30 



45 



60 



Thr Leu Asn Lys Leu Tyr Ala Trp Ser Leu Val Asn Tyr Asp Arg Val 

115 120 125 

Val Met Leu Asp Ala Asp Asn Leu Phe Leu Gin Lys Thr Asp Glu Leu 

130 135 140 



Phe Gin Cys Gly Gin Phe Cys Ala Val Phe lie Asn Pro Cys lie Phe 
20 145 150 155 160 

His Thr Gly Leu Phe Val Leu Gin Pro Ser Lys Lys Val Phe Asn Asp 
165 170 175 

25 Met lie His Glu lie Glu He Gly Arg Glu Asn Gin Asp Gly Ala Asp 
180 185 190 



Gin Gly Phe He Gly Gly His Phe Pro Asp Leu Leu Asp Arg Pro Met 
195 200 205 

Phe His Pro Pro Leu Asn Gly Thr Gin Leu Gin Gly Ser Tyr Arg Leu 
210 215 220 



Pro Leu Gly Tyr Gin Met Asp Ala Ser Tyr Tyr Tyr Leu Lys Leu His 
35 225 230 235 240 

Trp Ser Val Pro Cys Gly Pro Asn Ser Val He Thr Phe Pro Gly Ala 
245 250 255 

40 Pro Trp Leu Lys Pro Trp Tyr Trp Trp Ser Trp Pro Val Leu Pro Leu 
260 265 270 



Gly He Gin Trp His Glu Gin Arg Arg Leu Thr Val Gly Tyr Gly Ala 
275 280 285 

Glu Met He Ala Val Leu He Gin Ser He Phe Tyr Leu Gly He He 
290 295 300 



Ala Val Thr Arg Leu Ala Arg Pro Asn Leu Ser Lys Leu Cys Tyr Arg 
50 305 310 315 320 

His Asp Asp Ser Lys Ser Ala Phe Leu Leu Arg Thr Gly Leu Lys Leu 
325 330 335 

55 He Ala He Trp Ser He Leu Ala Ala Tyr Thr Val Pro Tyr Phe Val 
340 345 350 



He Pro Cys Thr Val His Pro Leu Val Gly Trp Ser Leu Tyr Leu Leu 
355 360 365 

Gly Ser Phe Ser Leu Ser Cys He Thr Val Asn Ala Phe Leu Leu Pro 
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370 

Met Leu Pro Val Leu 
385 

5 

Val Met Ala Tyr Pro 
405 

Val Phe Thr Tyr Ala 
10 420 

Val Lys He Lys Cys 
435 

15 Leu Pro Lys He Ser 
450 

Tyr 
465 

20 



5 

375 

Val Pro Trp He Gly 
390 

Trp Tyr Asn Asp Gly 
410 

Phe Cys Ala Ser Pro 
425 

Ser Leu His Val Ser 
440 

Glu Ser Thr Ala Pro 
455 



380 

He Leu Gly Ala Leu Leu 
395 400 

Val Val Arg Ala Met Ala 
415 

Ala Leu Trp Met Ala Leu 
430 

Leu Glu Arg Glu Gly Phe 
445 

Ala Gly Ser Asn Lys Leu 
460 



<210> 3 
<211> 2230 
25 <212> DNA 

<213> Solarium tuberosum 

<220> 
<221> CDS 
30 <222> (143) . . (2086) 

<400> 3 

tatccccaga gaatcagctg aatcaagaac tgatttttag attatgtttt cttgattctt 60 

35 tgaaatggga acttgatttt cagtttttca actcagatgt tgtgttcctt tagctggaaa 120 

acttgaaaaa ggaaagccca ga atg aga gga agt tta get ggt gga cca cct 172 

Met Arg Gly Ser Leu Ala Gly Gly Pro Pro 
15 10 

40 

agt cct att gaa cct aga cag agg ctt tct gta ttc act gag gaa aca 22 0 
Ser Pro He Glu Pro Arg Gin Arg Leu Ser Val Phe Thr Glu Glu Thr 
15 20 25 

45 age aaa aga agg ttc ttg aga agt aaa gtt ttc aga gat ggg gag aga 2 68 
Ser Lys Arg Arg Phe Leu Arg Ser Lys Val Phe Arg Asp Gly Glu Arg 
30 35 40 

get ctt cat agt ccc acc aaa aac agg aat ttt acc tgc aag ttc cca 316 
50 Ala Leu His Ser Pro Thr Lys Asn Arg Asn Phe Thr Cys Lys Phe Pro 
45 50 55 

act gtg aag ctt ata ttg ggt gtt att get ctg gtt gca att tgg tea 3 64 
Thr Val Lys Leu He Leu Gly Val He Ala Leu Val Ala He Trp Ser 
55 60 65 70 

etc tgg cat tct cca gca att tat aac acg gaa tac ata tct agt tea 412 

Leu Trp His Ser Pro Ala He Tyr Asn Thr Glu Tyr He Ser Ser Ser 
75 80 85 90 

60 

ggc tct egg get get ttg atg cac aga gag tta agt ggt cat tct tea 460 
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Gly Ser Arg Ala Ala Leu Met His Arg Glu Leu Ser Gly His Ser Ser 
95 100 105 



get gat caa 
5 Ala Asp Gin 



tec caa gtt 
Ser Gin Val 
10 125 

ggg ata tta 
Gly He Leu 
140 

15 

eta ccg gac 
Leu Pro Asp 
155 

20 aat ata aca 
Asn He Thr 



gaa ttt gag 
25 Glu Phe Glu 



ggt aaa cca 
Gly Lys Pro 
30 205 

tct aag gac 
Ser Lys Asp 
220 

35 

gca gca agg 
Ala Ala. Arg 
235 

40 ctt ctg gtt 
Leu Leu Val 



gag tta gtt 
45 Glu Leu Val 



aac act tta 
Asn Thr Leu 
50 2 85 

gca gtt cct 
Ala Val Pro 
300 

55 

cga gaa gec 
Arg Glu Ala 
315 

60 gga gee ata 
Gly Ala He 



cgt tat aca tea ctt 
Arg Tyr Thr Ser Leu 
110 

att gag aaa ctg gec 
He Glu Lys Leu Ala 
130 

aac ttc aat gac agt 
Asn Phe Asn Asp Ser 
145 

get gag cat gta ate 
Ala Glu His Val He 
160 

tgg gaa aca ata tat 
Trp Glu Thr He Tyr 
175 

gtc ccc act tgt cct 
Val Pro Thr Cys Pro 
190 

agg att gat etc ata 
Arg He Asp Leu He 
210 

tgg tat aga gat gta 
Trp Tyr Arg Asp Val 
225 

ctg get gec agt aat 
Leu Ala Ala Ser Asn 
240 

act gag cat ttc cca 
Thr Glu His Phe Pro 
255 

gta cgt gaa ggc aat 
Val Arg Glu Gly Asn 
270 

aga gag aag etc cac 
Arg Glu Lys Leu His 
290 

etc aag get aaa gca 
Leu Lys Ala Lys Ala 
305 

tat gca act att etc 
Tyr Ala Thr He Leu 
320 

get gca gca cag agt 
Ala Ala Ala Gin Ser 



tta gat att gac tgg 
Leu Asp He Asp Trp 
115 

gat agg cat gag tat 
Asp Arg His Glu Tyr 
135 

gaa att gat cag ttg 
Glu He Asp Gin Leu 
150 

ttg aac ctg gat cac 
Leu Asn Leu Asp His 
165 

cct gaa tgg ata gat 
Pro Glu Trp He Asp 
180 

tct ctg ccc aaa att 
Ser Leu Pro Lys He 
195 

gtt gta aag ctt cca 
Val Val Lys Leu Pro 
215 

get cgt ttt cac ttg 
Ala Arg Phe His Leu 
230 

aaa ggg tat cat cca 
Lys Gly Tyr His Pro 
245 

ace ccc aat ctg ttc 
Thr Pro Asn Leu Phe 
260 

gca tgg eta tat gaa 
Ala Trp Leu Tyr Glu 
275 

etc cct gtt ggg tea 
Leu Pro Val Gly Ser 
295 

aat tgg cac tct gga 
Asn Trp His Ser Gly 
310 

cac tea gca aat ttt 
His Ser Ala Asn Phe 
325 

att cgc ttg gca ggt 
He Arg Leu Ala Gly 



gac caa att 508 

Asp Gin He 

120 

cag ggc gta 556 
Gin Gly Val 



aag gag tta 604 
Lys Glu Leu 



gtc ccg aat 652 
Val Pro Asn 
170 

gaa gaa gaa 700 
Glu Glu Glu 
185 

cag ttt ccg 748 

Gin Phe Pro 

200 

tgc aag aag 796 
Cys Lys Lys 



cag ctg gca 844 
Gin Leu Ala 



ata cat gtg 892 
He His Val 
250 

ace tgt aaa 940 
Thr Cys Lys 
265 

cct aat ctg 988 

Pro Asn Leu 

280 

tgt gaa ctt 103 6 
Cys Glu Leu 



aat gta aga 1084 
Asn Val Arg 



tat gta tgt 1132 
Tyr Val Cys 
330 

tea acc cga 1180 
Ser Thr Arg 
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335 340 345 

gat ctt gtg ata ctt gtt gat gag act ate agt gac tac cac agg ggt 1228 
Asp Leu Val lie Leu Val Asp Glu Thr lie Ser Asp Tyr His Arg Gly 
5 350 355 . 360 

ggt tta gag get gec gga tgg aag ate cac acg ata aag aga ata agg 1276 

Gly Leu Glu Ala Ala Gly Trp Lys lie His Thr lie Lys Arg lie Arg 
365 370 375 

10 

aat cct aaa get gaa cag gat gec tac aat gag tgg aac tat age aaa 1324 

Asn Pro Lys Ala Glu Gin Asp Ala Tyr Asn Glu Trp Asn Tyr Ser Lys 
380 385 390 

15 ttt cgt etc tgg cag ctg aca gat tat gac aaa ate ate ttc att gat 1372 
Phe Arg Leu Trp Gin Leu Thr Asp Tyr Asp Lys lie lie Phe lie Asp 
395 400 405 410 

gcg gat ttg ttg ata ctg aga aat att gat ttt etc ttt gag atg cct 142 0 
20 Ala Asp Leu Leu lie Leu Arg Asn lie Asp Phe Leu Phe Glu Met Pro 

415 420 425 

gaa ata act gca ata gga aat aat gca ace ctt ttt aat tea ggc gtg 1468 
Glu lie Thr Ala lie Gly Asn Asn Ala Thr Leu Phe Asn Ser Gly Val 
25 430 435 440 

atg gtc gtt gaa cca tea aat tgc aca ttt cag ctg ttg atg gat cat 1516 
Met Val Val Glu Pro Ser Asn Cys Thr Phe Gin Leu Leu Met Asp His 
445 450 455 



30 

ate aat gag att gaa tea tac aat ggt ggg gat cag ggg tat ttg aac 1564 
He Asn Glu He Glu Ser Tyr Asn Gly Gly Asp Gin Gly Tyr Leu Asn 
460 465 470 

35 gaa ata ttc ace tgg tgg cat agg ate cca aaa cac atg aac ttt ttg 1612 
Glu He Phe Thr Trp Trp His Arg He Pro Lys His Met Asn Phe Leu 
475 480 485 490 

aaa cat tat tgg gaa ggt gat gag gag gag aag aag caa atg aaa aca 1660 
40 Lys His Tyr Trp Glu Gly Asp Glu Glu Glu Lys Lys Gin Met Lys Thr 

495 500 505 

cgt ctt ttt ggt get gat cct cca gtt etc tat gtt ctg cac tac tta 1708 
Arg Leu Phe Gly Ala Asp Pro Pro Val Leu Tyr Val Leu His Tyr Leu 
45 510 515 520 

ggc ctg aaa cct tgg tta tgc ttc agg gac tac gat tgc aac tgg aat 1756 

Gly Leu Lys Pro Trp Leu Cys Phe Arg Asp Tyr Asp Cys Asn Trp Asn 
525 530 535 

50 

gtg ggt aag ttg cag gag ttt gca agt gat gtg gca cac agg acg tgg 1804 

Val Gly Lys Leu Gin Glu Phe Ala Ser Asp Val Ala His Arg Thr Trp 
540 545 550 

55 tgg aag gtc cat gat gee atg cca gat aac tta cat aaa tat tgt ttg 1852 
Trp Lys Val His Asp Ala Met Pro Asp Asn Leu His Lys Tyr Cys Leu 
555 560 565 570 

ctt agg tct aaa cag aag get gca eta gag tgg gat cga aga gaa get 1900 
60 Leu Arg Ser Lys Gin Lys Ala Ala Leu Glu Trp Asp Arg Arg Glu Ala 

575 580 585 
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gag aaa get aac ttt tea gat ggt cat tgg aag ate aaa ata aag gac 1948 
Glu Lys Ala Asn Phe Ser Asp Gly His Trp Lys lie Lys lie Lys Asp 
590 595 600 

5 

cca cgt ttg gag act tgt tat gaa gaa ttt tgc ttc tgg gaa age atg 1996 
Pro Arg Leu Glu Thr Cys Tyr Glu Glu Phe Cys Phe Trp Glu Ser Met 
605 610 615 

10 tta tgg cac tgg ggt gaa aca aac tgg aca gat aat gee ace tct tea 2 044 
Leu Trp His Trp Gly Glu Thr . Asn Trp Thr Asp Asn Ala Thr Ser Ser 
620 625 630 

cca aca cct ccc atg gtc aat act get tea ctt tct tct ttg 2 086 

15 Pro Thr Pro Pro Met Val Asn Thr Ala Ser Leu Ser Ser Leu 
635 640 645 

taactaggag gttctgttac tatacctcgc tagtctgtaa gtaatacaga gtcacaactt 2146 
20 gaatgtcaac cagtgecatt agtatacatg gtgtcaacac tttccccacc cttgaaaaaa 2206 
aaaaaaaaaa aaaaaaaaaa aaaa 223 0 

25 <210> 4 

<211> 648 
<212> PRT 

<213> Solanum tuberosum 
30 <400> 4 

Met Arg Gly Ser Leu Ala Gly Gly Pro Pro Ser Pro lie Glu Pro Arg 
15 10 15 

Gin Arg Leu Ser Val Phe Thr Glu Glu Thr Ser Lys Arg Arg Phe Leu 
35 20 25 30 

Arg Ser Lys Val Phe Arg Asp Gly Glu Arg Ala Leu His Ser Pro Thr 
35 40 45 

40 Lys Asn Arg Asn Phe Thr Cys Lys Phe Pro Thr Val Lys Leu lie Leu 
50 55 60 



45 



Gly Val lie Ala Leu Val Ala lie Trp Ser Leu Trp His Ser Pro Ala 
65 70 75 80 

lie Tyr Asn Thr Glu Tyr lie Ser Ser Ser Gly Ser Arg Ala Ala Leu 
85 90 95 



Met His Arg Glu Leu Ser Gly His Ser Ser Ala Asp Gin Arg Tyr Thr 
50 100 105 110 

Ser Leu Leu Asp lie Asp Trp Asp Gin lie Ser Gin Val He Glu Lys 
115 120 125 

55 Leu Ala Asp Arg His Glu Tyr Gin Gly Val Gly He Leu Asn Phe Asn 
130 135 140 

Asp Ser Glu He Asp Gin Leu Lys Glu Leu Leu Pro Asp Ala Glu His 

145 150 155 160 

60 

Val He Leu Asn Leu Asp His Val Pro Asn Asn He Thr Trp Glu Thr 
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165 170 175 

lie Tyr Pro Glu Trp He Asp Glu Glu Glu Glu Phe Glu Val Pro Thr 
180 185 190 

5 

Cys Pro Ser Leu Pro Lys He Gin Phe Pro Gly Lys Pro Arg He Asp 
195 200 205 

Leu He Val Val Lys Leu Pro Cys Lys Lys Ser Lys Asp Trp Tyr Arg 
10 210 215 220 

Asp Val Ala Arg Phe His Leu Gin Leu Ala Ala Ala Arg Leu Ala Ala 
225 230 235 240 

15 Ser Asn Lys Gly Tyr His Pro He His Val Leu Leu Val Thr Glu His 

245 250 255 



20 



35 



50 



Phe Pro Thr Pro Asn Leu Phe Thr Cys Lys Glu Leu Val Val Arg Glu 
260 265 270 

Gly Asn Ala Trp Leu Tyr Glu Pro Asn Leu Asn Thr Leu Arg Glu Lys 
275 280 285 



Leu His Leu Pro Val Gly Ser Cys Glu Leu Ala Val Pro Leu Lys Ala 
25 290 295 300 

Lys Ala Asn Trp His Ser Gly Asn Val Arg Arg Glu Ala Tyr Ala Thr 
305 310 315 320 

30 He Leu His Ser Ala Asn Phe Tyr Val Cys Gly Ala He Ala Ala Ala 

325 330 335 



Gin Ser He Arg Leu Ala Gly Ser Thr Arg Asp Leu Val He Leu Val 

340 345 350 

Asp Glu Thr He Ser Asp Tyr His Arg Gly Gly Leu Glu Ala Ala Gly 
355 360 365 



Trp Lys He His Thr He Lys Arg He Arg Asn Pro Lys Ala Glu Gin 
40 370 375 380 

Asp Ala Tyr Asn Glu Trp Asn Tyr Ser Lys Phe Arg Leu Trp Gin Leu 
385 390 395 400 

45 Thr Asp Tyr Asp Lys He He Phe He Asp Ala Asp Leu Leu He Leu 

405 410 415 



Arg Asn He Asp Phe Leu Phe Glu Met Pro Glu lie. Thr Ala He Gly 

420 425 430 

Asn Asn Ala Thr Leu Phe Asn Ser Gly Val Met Val Val Glu Pro Ser 

435 440 445 



Asn Cys Thr Phe Gin Leu Leu Met Asp His He Asn Glu He Glu Ser 
55 450 - 455 - 460 

Tyr Asn Gly Gly Asp Gin Gly Tyr Leu Asn Glu He Phe Thr Trp Trp 
465 470 475 480 

60 His Arg He Pro Lys His Met Asn Phe Leu Lys His Tyr Trp Glu Gly 

485 490 495 
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25 



Asp Glu Glu Glu Lys Lys Gin Met Lys Thr Arg Leu Phe Gly Ala Asp 
500 505 510 

Pro Pro Val Leu Tyr Val Leu His Tyr Leu Gly Leu Lys Pro Trp Leu 
515 520 525 

Cys Phe Arg Asp Tyr Asp Cys Asn Trp Asn Val Gly Lys Leu Gin Glu 
530 535 540 

Phe Ala Ser Asp Val Ala His Arg Thr Trp Trp Lys Val His Asp Ala 
545 550 555 560 



Met Pro Asp Asn Leu His Lys Tyr Cys Leu Leu Arg Ser Lys Gin Lys 

15 565 570 575 

Ala Ala Leu Glu Trp Asp Arg Arg Glu Ala Glu Lys Ala Asn Phe Ser 
580 585 590 

20 Asp Gly His Trp Lys lie Lys He Lys Asp Pro Arg Leu Glu Thr Cys 
595 600 605 



Tyr Glu Glu Phe Cys Phe Trp Glu Ser Met Leu Trp His Trp Gly Glu 
610 615 620 

Thr Asn Trp Thr Asp Asn Ala Thr Ser Ser Pro Thr Pro Pro Met Val 

625 630 635 640 



Asn Thr Ala Ser Leu Ser Ser Leu 
30 645 





<210> 5 




























35 


<211> 1485 






























<212> DNA 






























<213> Arabidopsis thaliana 






















<220> 




























40 


<221> CDS 






























<222> (1) . . 


(1485) 


























<400> 5 




























45 


atg gat tta 


caa 


aga 


act 


ttg 


atg 


ttc 


tct 


tgt 


tgg 


gtt 


ctg 


tct 


ctt 


Met Asp Leu 


Gin 


Arg 


Thr 


Leu 


Met 


Phe 


Ser 


Cys 


Trp 


Val 


Leu 


Sei- 


Leu 




1 




5 










10 










15 






ttg ate ate 


aaa 


acg 


aca 


gcg 


tat 


aac 


gag 


aaa 


cag 


ctg 


ttc 


cag 


ccg 




Leu He lie 


Lys 


Thr 


Thr 


Ala 


Tyr 


Asn 


Glu 


Lys 


Gin 


Leu 


Phe 


Gin 


Pro 


50 




20 










25 










30 








ctt gaa acg 


gaa 


aac 


gca 


aac 


gcg 


atg 


acc 


gcg 


gtt 


atg 


gag 


cga 


gga 




Leu Glu Thr 


Glu 


Asn 


Ala 


Asn 


Ala 


Met 


Thr 


Ala 


Val 


Met 


Glu 


Arg 


Gly 




35 










40 










45 








55 
































tta aag acg 


cag 


egg 


egg 


ccg 


gag 


cac 


aag 


aac 


get 


tat 


gcg 


acg 


atg 




Leu Lys Thr 


Gin 


Arg 


Arg 


Pro 


Glu 


His 


Lys 


Asn 


Ala 


Tyr Ala 


Thr 


Met 




50 








55 










60 











48 



96 



144 



192 



60 atg tac atg gga aca cca aga gac tac gag ttc tac gtt gcg aca cgt 240 
Met Tyr Met Gly Thr Pro Arg Asp Tyr Glu Phe Tyr Val Ala Thr Arg 



10 
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65 70 75 80 

gtc ttg ate aga teg ctt aag agt etc cac gtg gac get gat ate gtc 2 88 

Val Leu He Arg Ser Leu Lys Ser Leu His Val Asp Ala Asp He Val 

85 90 95 

gtt ata gee tec etc gac gtt cct ate aac tgg att cac get ctg gaa 33 6 

Val He Ala Ser Leu Asp Val Pro He Asn Trp He His Ala Leu Glu 

100 105 110 

gaa gaa gat gga get aaa gta gtg aga gta gag aat ctt gag aat cca 3 84 

Glu Glu Asp Gly Ala Lys Val Val Arg Val Glu Asn Leu Glu Asn Pro 
115 120 125 



15 tac aag aaa caa acc aac ttc gac aac aga ttc aag ctt agt eta aac 432 
Tyr Lys Lys Gin Thr Asn Phe Asp Asn Arg Phe Lys Leu Ser Leu Asn 

130 135 140 

aag etc tac get tgg tct etc tct gat tat gac cgt gtt gta atg ctt 480 
20 Lys Leu Tyr Ala Trp Ser Leu Ser Asp Tyr Asp Arg Val Val Met Leu 

145 150 155 160 

gat gtc gac aat etc ttt etc aag aac acc gac gag etc ttc cag tgt 528 

Asp Val Asp Asn Leu Phe Leu Lys Asn Thr Asp Glu Leu Phe Gin Cys 

25 165 170 175 

ggc caa ttt tgt get gtc ttc ate aac cct tgc ate ttc cac act ggt 576 

Gly Gin Phe Cys Ala Val Phe He Asn Pro Cys He Phe His Thr Gly 

180 185 190 

30 

etc ttt gtg ttg cag cca tea atg gag gtc ttt aga gac atg ctt cat 624 

Leu Phe Val Leu Gin Pro Ser Met Glu Val Phe Arg Asp Met Leu His 

195 200 205 

35 gag ctt gaa gta aag aga gat aac cct gat gga get gat caa ggc ttt 672 

Glu Leu Glu Val Lys Arg Asp Asn Pro Asp Gly Ala Asp Gin Gly Phe 

210 215 220 

ctt gtc age tac ttc tct gat tta etc aat cag cct etc ttt cgt cct 72 0 

40 Leu Val Ser Tyr Phe Ser Asp Leu Leu Asn Gin Pro Leu Phe Arg Pro 

225 230 235 240 

cct ccc gat aac cgc acc gcg ctt aag gga cat ttt agg ctt cct ttg 768 

Pro Pro Asp Asn Arg Thr Ala Leu Lys Gly His Phe Arg Leu Pro Leu 

45 245 250 255 

gga tat caa atg gac gca tct tat tac tac ctt aag etc aga tgg aac 816 

Gly Tyr Gin Met Asp Ala Ser Tyr Tyr Tyr Leu Lys Leu Arg Trp Asn 

260 265 270 



50 



gta cca tgt gga cca aac agt gtg ata acg ttc cca gga gca gta tgg 864 
Val Pro Cys Gly Pro Asn Ser Val He Thr Phe Pro Gly Ala Val Trp 
275 280 285 



55 tta aag cca" tgg tat tgg tgg tea ~tgg~ cct gtt ctt cct" tta ggc ctt 912 
Leu Lys Pro Trp Tyr Trp Trp Ser Trp Pro Val Leu Pro Leu Gly Leu 
290 295 300 



60 



tea tgg cac cac caa cgc cgc tac acg att agt tat tea gca gag atg 
Ser Trp His His Gin Arg Arg Tyr Thr He Ser Tyr Ser Ala Glu Met 
305 310 315 320 



960 
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cct tgg gtc eta acc caa gca gtg ttc tac eta gga att ata eta gtc 1008 
Pro Trp Val Leu Thr Gin Ala Val Phe Tyr Leu Gly lie lie Leu Val 
325 330 335 

5 

aca cgt eta gcg cgt ccc aac atg acc aag eta tgt tac cga cgt tct 1056 
Thr Arg Leu Ala Arg Pro Asn Met Thr Lys Leu Cys Tyr Arg Arg Ser 
340 345 350 

10 gat aag aat eta age atg ate cag aca get ttc aag ttt gtt gca etc 1104 
Asp Lys Asn Leu Ser Met lie Gin Thr Ala Phe Lys Phe Val Ala Leu 
355 360 365 

etc ttt ate etc tea gec tac att ata cca ttc ttc ate ate cca cag 1152 
15 Leu Phe lie Leu Ser Ala Tyr lie lie Pro Phe Phe lie lie Pro Gin 
370 375 380 

acg ate cac ccg etc att ggt tgg tct etc tac tta acc ggc tec ttt 12 00 
Thr lie His Pro Leu lie Gly Trp Ser Leu Tyr Leu Thr Gly Ser Phe 
20 385 390 395 400 

get etc tct acc ata ccc ate aac gee ttc ttg ctt ccc att etc cct 1248 
Ala Leu Ser Thr lie Pro lie Asn Ala Phe Leu Leu Pro lie Leu Pro 
405 410 415 



25 



gtc ata aca ccg tgg ctt ggc att ttc ggg aca etc etc gtg atg get 12 96 
Val lie Thr Pro Trp Leu Gly lie Phe Gly Thr Leu Leu Val Met Ala 
420 425 430 

30 ttt cct tct tat cct gat ggc gtt gtt aga gca ttg teg gtt ttc ggg 13 44 
Phe Pro Ser Tyr Pro Asp Gly Val Val Arg Ala Leu Ser Val Phe Gly 
435 440 445 

tat gca ttt tgt tgt gca ccg ttt eta tgg gtc tec ttt gtg aag ate 13 92 
35 Tyr Ala Phe Cys Cys Ala Pro Phe Leu Trp Val Ser Phe Val Lys lie 
450 455 460 

aca teg cat ctt cag att atg att gac aaa gag gtt ttg ttt ccg egg 1440 
Thr Ser His Leu Gin He Met He Asp Lys Glu Val Leu Phe Pro Arg 
40 465 470 475 480 

ttg ggt gaa tec gga gtc act tct ggt etc age aaa ttg tac tga 1485 
Leu Gly Glu Ser Gly Val Thr Ser Gly Leu Ser Lys Leu Tyr 

485 490 495 

45 

<210> 6 
<211> 494 
<212> PRT 
50 <213> Arabidopsis thaliana 

<400> 6 

Met Asp Leu Gin Arg Thr Leu Met Phe Ser Cys Trp Val Leu Ser Leu 

1 5 10 15 

55 Leu He lie Lys Thr Thr Ala Tyr Asn Glu Lys Gin Leu Phe Gin Pro 

20 25 30 

Leu Glu Thr Glu Asn Ala Asn Ala Met Thr Ala Val Met Glu Arg Gly 

35 40 45 

Leu Lys Thr Gin Arg Arg Pro Glu His Lys Asn Ala Tyr Ala Thr Met 

60 50 55 60 

Met Tyr Met Gly Thr Pro Arg Asp Tyr Glu Phe Tyr Val Ala Thr Arg 
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65 










70 










•7 C 














Val 


Leu 


He 


Arg 


Ser 
85 


Leu 


Lys 


ber 


Leu 


His 
90 


vai 


Asp 


AT a 
nla 




lie 


vai 




Val 


He 


Ala 


Ser 


Leu 


Asp 


Val 


Pro 


He 


Asn 


Trp 


He 


His 


Ala 


Leu 


Glu 


5 








100 










105 










11 U 








Glu 


Glu 


Asp 


Gly Ala 


Lys 


Val 


Val 


Arg Val 


CjJLU 


Asn 


Leu 




Asn 










115 










120 










-IOC 

IzS D 










Tyr 


Lys 


Lys 


Gin 


Thr 


Asn 


Phe 


Asp 


Asn 


Arg 


Pne 


Lys 


Leu 


Ser 


Leu 


Asn 




130 










135 










1 A ft 

140 










10 


Lys 
145 


Leu 


Tyr 


Ala 


Trp 


Ser 
150 


Leu 


Ser 


Asp 


Tyr 


Asp 
155 


Arg 


Val 


Val 


Met 


Leu 
lbU 




Asp 


Val 


Asp 


Asn 


Leu 


Phe 


Leu 


Lys 


Asn 


Thr 


Asp 


Glu 


Leu 


Phe 


Gin 


Cys 










165 










170 










I/O 






Gly 


Gin 


Phe 


Cys 


Ala 


Val 


Phe 


He 


Asn 


Pro 


Cys 


He 


Phe 


His 


Thr 


Gly 


15 






180 










185 










±y u 








Leu 


Phe 


Val 
195 


Leu 


Gin 


Pro 


Ser 


Met 
200 


Glu 


Val 


Phe 


Arg 


Asp 

o A r 

2 05 


Met 


Leu 


His 




Glu 


Leu 
210 


Glu 


Val 


Lys 


Arg 


Asp 
215 


Asn 


Pro 


Asp 


Gly 


ax a 


Asp 








20 


Leu 
225 


Val 


Ser 


Tyr 


Phe 


Ser 
230 


Asp 


Leu 


Leu 


Asn 


Gin 
235 


Pro 


Leu 


Phe 


Arg 


Pro 

-6 4 U 




Pro 


Pro 


Asp 


Asn Arg 


Thr 


Ala 


Leu 


Lys 


Gly 


His 


Phe 


Arg 


Leu 


Pro 


Leu 












245 










250 










ADZ> 






Gly 


Tyr 


Gin 


Met 


Asp 


Ala 


Ser 


Tyr 


Tyr 


Tyr 


Leu 


Lys 


Leu 


Arg 


Trp 


Asn 


25 




260 










265 










OTA 

Z 1 u 








Val 


Pro 


Cys 
275 


Gly 
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<210> 7 

<211> 1494 

<212> DNA 

60 <213> Arabidopsis thaliana 
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<220> 

<221> CDS 

<222> (1) . . (1494) 

5 <400> 7 

atg gat ttg cag aga ggc ttt gtg ttc ttg tct ttg gtt eta tct ttt 48 

Met Asp Leu Gin Arg Gly Phe Val Phe Leu Ser Leu Val Leu Ser Phe 

15 10 15 

10 atg ata ate gaa acg acg gcg tat cga gag aga cag ctg ctg ctg ctg 96 
Met lie lie Glu Thr Thr Ala Tyr Arg Glu Arg Gin Leu Leu Leu Leu 
20 25 30 

caa cca ccg caa gaa acg gcg ata gat acc gca aac gcg gtg gtg acg 144 
15 Gin Pro Pro Gin Glu Thr Ala He Asp Thr Ala Asn Ala Val Val Thr 
35 40 45 



gtt caa gat cga ggt ttg aag acg egg cga ccg gag cat aag aac gca 192 
Val Gin Asp Arg Gly Leu Lys Thr Arg Arg Pro Glu His Lys Asn Ala 
20 50 55 60 



25 



tac gca acg atg atg tac atg ggg acg cca aga gac tac gag ttc tac 240 
Tyr Ala Thr Met Met Tyr Met Gly Thr Pro Arg Asp Tyr Glu Phe Tyr 
65 70 75 80 

gtt gcg aca cgt gtt ttg ate aga teg ttg aga agt ctt cac gtg gaa 288 
Val Ala Thr Arg Val Leu He Arg Ser Leu Arg Ser Leu His Val Glu 

85 90 95 

30 get gat etc gtc gtc ate get tct etc gac gtt cct etc cga tgg gtt 336 
Ala Asp Leu Val Val He Ala Ser Leu Asp Val Pro Leu Arg Trp Val 
100 105 110 

caa acc ttg gaa gag gaa gat gga get aaa gtg gtg aga gtt gaa aat 3 84 
35 Gin Thr Leu Glu Glu Glu Asp Gly Ala Lys Val Val Arg Val Glu Asn 
115 120 125 

gtg gat aat cca tac agg aga cag acc aac ttc aac agt aga ttc aag 432 
Val Asp Asn Pro Tyr Arg Arg Gin Thr Asn Phe Asn Ser Arg Phe Lys 
40 130 135 140 

ctt act eta aac aag etc tac get tgg get ttg tct gat tac gac cgt 480 
Leu Thr Leu Asn Lys Leu Tyr Ala Trp Ala Leu Ser Asp Tyr Asp Arg 
145 150 155 160 



45 



gtg gtc atg eta gat gec gat aac etc ttt ctt aag aaa gee gac gag 52 8 
Val Val Met Leu Asp Ala Asp Asn Leu Phe Leu Lys Lys Ala Asp Glu 
165 170 175 



50 ttg ttc cag tgt ggg cgc ttc tgt gcg gtc ttc att aac cct tgt ate 576 
Leu Phe Gin Cys Gly Arg Phe Cys Ala Val Phe He Asn Pro Cys He 
180 185 190 

ttc cac act ggt etc ttc gtg ttg cag cca tea gtg gaa gtg ttc aag 624 
55 Phe His Thr Gly Leu Phe Val Leu Gin Pro Ser Val Glu Val Phe Lys 
195 200 205 



60 



gac atg etc cat gag eta caa gtt gga aga aag aat cct gat gga get 
Asp Met Leu His Glu Leu Gin Val Gly Arg Lys Asn Pro Asp Gly Ala 
210 215 220 



672 
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gat caa ggt ttt ctt gtc agt tac ttc tct gat ctt ctt gac caa cct 72 0 

Asp Gin Gly Phe Leu Val Ser Tyr Phe Ser Asp Leu Leu Asp Gin Pro 

225 230 235 240 

5 etc ttt agt cct ccg agt aac gga tct gta ctt aat ggt cac ttg aga 768 

Leu Phe Ser Pro Pro Ser Asn Gly Ser Val Leu Asn Gly His Leu Arg 

245 250 255 

ctt ccc tta ggc tac caa atg gac get tct tat ttc tat ctt aag eta 816 

10 Leu Pro Leu Gly Tyr Gin Met Asp Ala Ser Tyr Phe Tyr Leu Lys Leu 

260 265 270 

aga tgg aac ata ccc tgt gga cca aac agt gtg att aca ttc ccg gga 864 

Arg Trp Asn He Pro Cys Gly Pro Asn Ser Val He Thr Phe Pro Gly 
15 275 280 285 

get gtt tgg tta aag cca tgg tac tgg tgg tea tgg cct gtt ctt cca 912 

Ala Val Trp Leu Lys Pro Trp Tyr Trp Trp Ser Trp Pro Val Leu Pro 
290 295 300 



20 



eta ggt ttc tea tgg cac gag cag cgt cgc gee act ata ggg tac tea 960 
Leu Gly Phe Ser Trp His Glu Gin Arg Arg Ala Thr He Gly Tyr Ser 
305 310 315 320 

25 gee gaa atg cct ttg gtt ata ate caa gca atg ttt tac ctt gga ate 1008 

Ala Glu Met Pro Leu Val lie He Gin Ala Met Phe Tyr Leu Gly He 
325 330 335 

ata gtg gtt aca cga eta get cgt cct aac ata ace aag eta tgt tat 1056 

30 He Val Val Thr Arg Leu Ala Arg Pro Asn He Thr Lys Leu Cys Tyr 
340 345 350 

cgc cgc tct gac cgc aac tta aca acg ate caa get ggt ttt aag ttg 1104 

Arg Arg Ser Asp Arg Asn Leu Thr Thr He Gin Ala Gly Phe Lys Leu 
35 355 360 365 

ate gcg ctt etc tct gta gtt gca gee tac ate ttc cea ttc ttc ace 1152 

He Ala Leu Leu Ser Val Val Ala Ala Tyr He Phe Pro Phe Phe Thr 

370 375 380 

40 

ate cct cac act ate cac cca etc ate ggc tgg teg etc tac ttg atg 1200 

He Pro His Thr He His Pro Leu He Gly Trp Ser Leu Tyr Leu Met 
385 390 395 400 

45 get tct ttt get etc tct tec att tea ate aac act etc etc etc cca 1248 

Ala Ser Phe Ala Leu Ser Ser He Ser He Asn Thr Leu Leu Leu Pro 
405 410 415 

acg etc cct gtt etc act cca tgg eta gga att etc ggc act etc ctt 1296 

50 Thr Leu Pro Val Leu Thr Pro Trp Leu Gly He Leu Gly Thr Leu Leu 
420 425 430 

gtc atg gec ttc cct tgg tac cct gat gga gtg gtc aga gee ttg tea 1344 

Val Met Ala Phe Pro Trp Tyr Pro Asp Gly Val Val Arg Ala Leu Ser 
55 435 440 445 

gtt ttc gca tac gca ttt tgt tgc gca ccc ttt gtg tgg gtt tea ttc 13 92 

Val Phe Ala Tyr Ala Phe Cys Cys Ala Pro Phe Val Trp Val Ser Phe 

450 455 460 



60 



cgc aaa ate aca teg cac etc cag gtt ttg att gag aaa gag gtg ttg 1440 



1 » V ■ • 
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Arg Lys lie Thr Ser His Leu Gin 
465 470 

ttc ccg cga ttg gga gac tea ggg 
5 Phe Pro Arg Leu Gly Asp Ser Gly 

485 

tat tag 
Tyr 

10 

<210> 8 
<211> 497 
<212> PRT 
15 <213> Arabidopsis thaliana 
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<210> 9 
<211> 1944 
25 <212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 
30 <222> (1) . . (1944) 

<400> 9 

atg cat cca get tea tgt agt etc tea etc agg aaa gtc aaa ctt aat 48 

Met His Pro Ala Ser Cys Ser Leu Ser Leu Arg Lys Val Lys Leu Asn 
35 1 5 10 15 

att ctt aca gtg aga atg aag etc tct tct gaa agt ccc atg gcg ccg 96 

He Leu Thr Val Arg Met Lys Leu Ser Ser Glu Ser Pro Met Ala Pro 

20 25 30 

40 

tea agt cag tea agt cat cga ctt tac att tec agt gag aaa aca aaa 144 

Ser Ser Gin Ser Ser His Arg Leu Tyr He Ser Ser Glu Lys Thr Lys 
35 40 45 

45 acg aag aga ttc caa aga aac gga tac act etc gat gtt gaa atg tgt 192 
Thr Lys Arg Phe Gin Arg Asn Gly Tyr Thr Leu Asp Val Glu Met Cys 
50 55 60 

gtc aac ttc tct agt ctg aaa ctt gtt ttg ttt ctt atg atg ctg gtt 240 
50 Val Asn Phe Ser Ser Leu Lys Leu Val Leu Phe Leu Met Met Leu Val 
65 70 75 80 

get atg ttc aca etc tac tgt tct cca ccg ttg caa att cct gaa gat 288 
Ala Met Phe Thr Leu Tyr Cys Ser Pro Pro Leu Gin He Pro Glu Asp 
55 85 90 95 

cca tea agt ttt gca aac aaa tgg ata eta gaa cct get gta acc aca 33 6 
Pro Ser Ser Phe Ala Asn Lys Trp He Leu Glu Pro Ala Val Thr Thr 
100 105 110 



60 



gat cct cgc tat ata get aca tct gag ate aac tgg aac agt atg tea 384 
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Asp Pro Arg Tyr lie Ala Thr Ser Glu lie Asn Trp Asn Ser Met Ser 

115 120 125 

ctt gtt gtt gag cat tac tta tct ggc aga age gag tat caa gga att 432 

5 Leu Val Val Glu His Tyr Leu Ser Gly Arg Ser Glu Tyr Gin Gly lie 
130 135 140 

ggc ttt eta aat etc aac gat aac gag att aat cga tgg cag gtg gtc 480 

Gly Phe Leu Asn Leu Asn Asp Asn Glu lie Asn Arg Trp Gin Val Val 
10 145 150 155 160 

ata aaa tct cac tgt cag cat ata get ttg cat eta gac cat get gca 52 8 

He Lys Ser His Cys Gin His He Ala Leu His Leu Asp His Ala Ala 
165 170 175 



15 



55 



agt aac ata act tgg aaa tct tta tac ccg gaa tgg att gac gag gaa 576 
Ser Asn He Thr Trp Lys Ser Leu Tyr Pro Glu Trp He Asp Glu Glu 
180 185 190 



20 gaa aaa ttc aaa gtc ccc act tgt cct tct ctt cct tgg att caa gtt 624 
Glu Lys Phe Lys Val Pro Thr Cys Pro Ser Leu Pro Trp He Gin Val 
195 200 205 



cct gac aag tct cga ate gat ctt ate att gec aag etc cca tgt aac 
25 Pro Asp Lys Ser Arg He Asp Leu He He Ala Lys Leu Pro Cys Asn 
210 215 220 



672 



aag tea gga aaa tgg tea aga gat gtg get aga ttg cac tta caa ctt 720 
Lys Ser Gly Lys Trp Ser Arg Asp Val Ala Arg Leu His Leu Gin Leu 
30 225 230 235 240 

gca gca get cga gtg gcg gca tct tct gaa ggg ctt cat gat gtt cat 7 68 

Ala Ala Ala Arg Val Ala Ala Ser Ser Glu Gly Leu His Asp Val His 
245 250 255 

35 

gtg att ttg gta tea gat tgc ttt cca ata ccg aat ctt ttt acg ggt 816 

Val He Leu Val Ser Asp Cys Phe Pro He Pro Asn Leu Phe Thr Gly 
260 265 270 

40 caa gaa ctt gtt gee cgt caa gga aac ata tgg ctg tat aag cct aaa 864 
Gin Glu Leu Val Ala Arg Gin Gly Asn He Trp Leu Tyr Lys Pro Lys 
275 280 285 

ctt cac cag tta aga caa aag tta caa ctt cct gtt ggt tec tgt gaa 912 
45 Leu His Gin Leu Arg Gin Lys Leu Gin Leu Pro Val Gly Ser Cys Glu 
290 295 300 

ctt tct gtt cct ctt caa get aaa gat aat ttc tac teg gca aat gee 960 
Leu Ser Val Pro Leu Gin Ala Lys Asp Asn Phe Tyr Ser Ala Asn Ala 
50 305 310 315 320 

aag aaa gaa gcg tac gcg acg ate ttg cac tea gat gat get ttt gtc 1008 
Lys Lys Glu Ala Tyr Ala Thr He Leu His Ser Asp Asp Ala Phe Val 
325 330 335 



tgt gga gee att gca gta gca cag age att cga atg tea ggc tct act 1056 
Cys Gly Ala He Ala Val Ala Gin Ser He Arg Met Ser Gly Ser Thr 
340 345 350 



60 cgc aat ttg gta ata eta gtc gat gat teg ate agt gaa tac cat aga 1104 
Arg Asn Leu Val He Leu Val Asp Asp Ser He Ser Glu Tyr His Arg 
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355 360 365 

agt ggc ttg gaa tea get gga tgg aag att cac aca ttt caa aga ate 1152 

Ser Gly Leu Glu Ser Ala Gly Trp Lys lie His Thr Phe Gin Arg lie 
5 370 375 380 

aga aac ccg aaa get gaa gca aat gca tat aac caa tgg aac tac age 1200 

Arg Asn Pro Lys Ala Glu Ala Asn Ala Tyr Asn Gin Trp Asn Tyr Ser 
385 390 395 400 



10 



30 



50 



aaa ttc cgt ctt tgg gaa ttg aca gaa tac aac aag ate ate ttc att 1248 
Lys Phe Arg Leu Trp Glu Leu Thr Glu Tyr Asn Lys lie He Phe He 
405 410 415 



15 gat gca gac atg ctt ate etc aga aac atg gat ttc etc ttc gag tac 1296 
Asp Ala Asp Met Leu He Leu Arg Asn Met Asp Phe Leu Phe Glu Tyr 
420 425 430 

ccc gaa ate tec aca act gga aac gac ggt acg etc ttc aac tec ggt 1344 
20 Pro Glu He Ser Thr Thr Gly Asn Asp Gly Thr Leu Phe Asn Ser Gly 
435 440 445 

eta atg gtg att gaa cca tea aat tea aca ttc cag tta eta atg gat 1392 
Leu Met Val He Glu Pro Ser Asn Ser Thr Phe Gin Leu Leu Met Asp 
25 450 455 460 

cac ate aac gat ate aat tec tac aat gga gga gac caa ggt tac ctt 1440 
His He Asn Asp He Asn Ser Tyr Asn Gly Gly Asp Gin Gly Tyr Leu 
465 470 475 480 



aac gag ata ttc aca tgg tgg cat egg att cca aaa cac atg aat ttc 1488 
Asn Glu He Phe Thr Trp Trp His Arg He Pro Lys His Met Asn Phe 
485 490 495 



35 ttg aag cat ttc tgg gaa gga gac aca cct aag cac agg aaa tct aag 153 6 

Leu Lys His Phe Trp Glu Gly Asp Thr Pro Lys His Arg Lys Ser Lys 

500 ~ 505 510 

acg aga eta ttt gga get gat ect ccg ata etc tac gtt ctt cat tac 1584 

40 Thr Arg Leu Phe Gly Ala Asp Pro Pro He Leu Tyr Val Leu His Tyr 

515 520 525 

eta ggt tac aac aaa cca tgg gta tgc ttc aga gac tac gat tgc aat 1632 

Leu Gly Tyr Asn Lys Pro Trp Val Cys Phe Arg Asp Tyr Asp Cys Asn 
45 530 535 540 

tgg aat gtc gtt gga tac cat caa ttc gcg age gat gaa gca cac aaa 1680 

Trp Asn Val Val Gly Tyr His Gin Phe Ala Ser Asp Glu Ala His Lys 

545 550 555 560 



act tgg tgg aga gtg cac gac gcg atg cct aag aaa ttg cag agg ttt 1728 
Thr Trp Trp Arg Val His Asp Ala Met Pro Lys Lys Leu Gin Arg Phe 
565 570 575 



55 tgt eta ctg agt teg aaa caa aag gcg caa ctt gag tgg gat egg aga 1776 
Cys Leu Leu Ser Ser Lys Gin Lys Ala Gin Leu Glu Trp Asp Arg Arg 
580 585 590 

caa get gag aaa gcg aat tac aga gac gga cat tgg agg att aag ate 1824 
60 Gin Ala Glu Lys Ala Asn Tyr Arg Asp Gly His Trp Arg He Lys He 
595 600 605 
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aaa gat aag aga ctt acg act tgt ttt gaa gat ttc tgt ttc tgg gag 1872 

Lys Asp Lys Arg Leu Thr Thr Cys Phe Glu Asp Phe Cys Phe Trp Glu 
610 615 620 

5 

agt atg ctt tgg cat tgg ggt gag act cag acc aac tec acc gtc get 1920 
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<222> (1) . . (1980) 
<400> 13 

atg gca aac tct ccc get get cct gca ccc acc acc aca acc ggt ggt 48 

5 Met Ala Asn Ser Pro Ala Ala Pro Ala Pro Thr Thr Thr Thr Gly Gly 
15 10 15 

gac tec egg cga cgc etc tec gcg tec ata gaa gca ata tgc aag agg 96 

Asp Ser Arg Arg Arg Leu Ser Ala Ser lie Glu Ala lie Cys Lys Arg 
10 20 25 30 

aga ttc egg aga aat age aaa gga ggt ggc aga teg gat atg gtg aaa 144 

Arg Phe Arg Arg Asn Ser Lys Gly Gly Gly Arg Ser Asp Met Val Lys 
35 40 45 



15 



55 



ccg ttt aat ate ata aat ttt teg aca caa gac aaa aac agt agt tgt 192 
Pro Phe Asn lie lie Asn Phe Ser Thr Gin Asp Lys Asn Ser Ser Cys 
50 55 60 



20 tgt tgt ttc acc aag ttt cag ate gtg aag ctt etc ttg ttt ate ctt 240 
Cys Cys Phe Thr Lys Phe Gin He Val Lys Leu Leu Leu Phe He Leu 
65 70 75 80 

etc tct gee act etc ttc acc att ate tat tct cct gaa get tat cat 2 88 
25 Leu Ser Ala Thr Leu Phe Thr He He Tyr Ser Pro Glu Ala Tyr His 

85 90 95 

cat tct ctt tec cac tea tct tct cga tgg ata tgg aga aga caa gat 33 6 
His Ser Leu Ser His Ser Ser Ser Arg Trp He Trp Arg Arg Gin Asp 
30 100 105 110 

cca cgt tac ttc teg gat ctg gat ata aac tgg gac gat gtg act aaa 3 84 
Pro Arg Tyr Phe Ser Asp Leu Asp He Asn Trp Asp Asp Val Thr Lys 
115 120 125 

35 

acc ctt gag aac ate gaa gaa ggc cgt acg ate ggt gtc ttg aat ttt 432 
Thr Leu Glu Asn He Glu Glu Gly Arg Thr He Gly Val Leu Asn Phe 
130 135 140 

40 gat teg aac gag ate caa cga tgg aga gaa gta tec aag age aag gac 480 
Asp Ser Asn Glu He Gin Arg Trp Arg Glu Val Ser Lys Ser Lys Asp 
145 150 155 160 

aat ggg gat gaa gaa aaa gtt gtt gta ttg aat eta gat tac gca gac 52 8 
45 Asn Gly Asp Glu Glu Lys Val Val Val Leu Asn Leu Asp Tyr Ala Asp 

165 170 175 

aag aat gtg act tgg gac gca eta tat cca gag tgg ate gat gag gag 576 
Lys Asn Val Thr Trp Asp Ala Leu Tyr Pro Glu Trp He Asp Glu Glu 
50 180 185 190 

caa gaa aca gag gtc cct gtt tgt cct aat ate ccg aac att aag gta 624 
Gin Glu Thr Glu Val Pro Val Cys Pro Asn He Pro Asn He Lys Val 
195 200 205 



cct aca aga aga etc gat ctg ate gtc gtg aaa ctt cct tgt egg aaa 672 
Pro Thr Arg Arg Leu Asp Leu He Val Val Lys Leu Pro Cys Arg Lys 
210 215 220 



60 gaa ggg aat tgg teg aga gac gtc ggg aga ttg cat eta cag eta gcg 
Glu Gly Asn Trp Ser Arg Asp Val Gly Arg Leu His Leu Gin Leu Ala 



720 
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225 230 235 240 

get gca act gtg gcg get teg gec aaa ggg ttt ttc agg gga cat gtg 768 
Ala Ala Thr Val Ala Ala Ser Ala Lys Gly Phe Phe Arg Gly His Val 
245 250 255 

ttt ttt gta tct aga tgc ttt ccg att ccg aat ctt ttc egg tgt aaa 816 
Phe Phe Val Ser Arg Cys Phe Pro lie Pro Asn Leu Phe Arg Cys Lys 
260 265 270 

gat ctt gtg tct egg aga ggc gat gtt tgg ttg tac aaa cct aat ctt 864 
Asp Leu Val Ser Arg Arg Gly Asp Val Trp Leu Tyr Lys Pro Asn Leu 
275 280 285 

15 gat ace ttg aga gac aag ctt cag ctg cct gta ggg tct tgt gag eta 912 
Asp Thr Leu Arg Asp Lys Leu Gin Leu Pro Val Gly Ser Cys Glu Leu 
290 295 300 

tct ctt cct ctt ggc ate caa gat agg cca age tta gga aac cct aaa 960 
20 Ser Leu Pro Leu Gly lie Gin Asp Arg Pro Ser Leu Gly Asn Pro Lys 
305 310 315 320 

aga gaa get tac gca aca att ctt cac tea get cac gtt tac gtc tgc 1008 
Arg Glu Ala Tyr Ala Thr lie Leu His Ser Ala His Val Tyr Val Cys 
25 325 330 335 

ggt gca ate gee gcg get cag age ata aga cag tct ggt teg acg aga 1056 

Gly Ala lie Ala Ala Ala Gin Ser lie Arg Gin Ser Gly Ser Thr Arg 

340 345 350 

30 

gac ctt gtt ate ctt gtt gat gac aac ate age ggt tac cac egg agt 1104 

Asp Leu Val lie Leu Val Asp Asp Asn lie Ser Gly Tyr His Arg Ser 
355 360 365 

35 gga eta gaa gee gcg ggt tgg caa ate egg acg ata' cag agg att cga 1152 
Gly Leu Glu Ala Ala Gly Trp Gin lie Arg Thr lie Gin Arg lie Arg 
370 375 380 

aac cct aag gca gag aaa gat get tac aac gaa tgg aac tac age aag 12 00 
40 Asn Pro Lys Ala Glu Lys Asp Ala Tyr Asn Glu Trp Asn Tyr Ser Lys 
385 390 395 400 

ttc egg eta tgg cag ctg act gat tac gac aaa ate att ttc ate gac 12 4 8 
Phe Arg Leu Trp Gin Leu Thr Asp Tyr Asp Lys lie lie Phe lie Asp 
45 405 410 415 

gcg gat etc tta ate ttg aga aac ate gat ttc ttg ttc teg atg cct 129 6 

Ala Asp Leu Leu lie Leu Arg Asn lie Asp Phe Leu Phe Ser Met Pro 

420 425 430 

50 

gag ate tea get aca gga aac aat gga act ctg ttt aat tea gga gtt 13 44 

Glu lie Ser Ala Thr Gly Asn Asn Gly Thr Leu Phe Asn Ser Gly Val 
435 440 445 

55 atg gtg ate gag cct tgc aac tgt acg ttt cag ctt ctg atg gaa cat 13 92 
Met Val He Glu Pro Cys Asn Cys Thr Phe Gin Leu Leu Met Glu His 
450 455 460 

ata aac gag att gag tct tat aac ggt gga gat caa ggt tac tta aac 1440 
60 He Asn Glu He Glu Ser Tyr Asn Gly Gly Asp Gin Gly Tyr Leu Asn 
465 470 475 480 
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gag gta ttc aca tgg tgg cac egg att cca aaa cat atg aat ttc ttg 1488 
Glu Val Phe Thr Trp Trp His Arg lie Pro Lys His Met Asn Phe Leu 
485 490 495 

5 

aag cat ttt tgg att ggc gat gaa gat gac gcg aaa cgc aag aaa aca 1536 
Lys His Phe Trp lie Gly Asp Glu Asp Asp Ala Lys Arg Lys Lys Thr 
500 505 510 

10 gag ctt ttt gga gca gag cct cct gtt ctt tat gtt ctt cat tac ctt 1584 
Glu Leu Phe Gly Ala Glu Pro Pro Val Leu Tyr Val Leu His Tyr Leu 
515 520 525 

ggg atg aag ccg tgg tta tgt tac cgt gac tac gac tgt aac ttc aac 163 2 
15 Gly Met Lys Pro Trp Leu Cys Tyr Arg Asp Tyr Asp Cys Asn Phe Asn 
530 535 540 

tec gac ata ttc gtt gag ttt get acc gat ate get cat cga aaa tgg 1680 
Ser Asp lie Phe Val Glu Phe Ala Thr Asp lie Ala His Arg Lys Trp 
20 545 550 555 560 

tgg atg gtc cac gac gec atg cca cag gaa ctt cac caa ttc tgt tac 172 8 
Trp Met Val His Asp Ala Met Pro Gin Glu Leu His Gin Phe Cys Tyr 
565 570 575 

25 

ttg cga tec aag caa aag gca cag ctg gaa tat gat cgc egg caa gca 1776 
Leu Arg Ser Lys Gin Lys Ala Gin Leu Glu Tyr Asp Arg Arg Gin Ala 
580 585 590 

30 gag gec gca aat tat gee gac ggt cat tgg aaa ata aga gta aag gac 1824 
Glu Ala Ala Asn Tyr Ala Asp Gly His Trp Lys lie Arg Val Lys Asp 
595 600 605 

ccg aga ttc aaa att tgc ate gac aaa tta tgt aat tgg aaa agt atg 1872 
35 Pro Arg Phe Lys He Cys He Asp Lys Leu Cys Asn Trp Lys Ser Met 
610 615 620 

ctg egg cat tgg ggc gaa tea aat tgg act gac tac gag tct ttt gtt 192 0 
Leu Arg His Trp Gly Glu Ser Asn Trp Thr Asp Tyr Glu Ser Phe Val 
40 625 630 635 640 

ccc acc cca cca gee att acc gta gac egg aga tea tea ctt ccc ggc 196 8 
Pro Thr Pro Pro Ala He Thr Val Asp Arg Arg Ser Ser Leu Pro Gly 
645 650 655 

45 

cat aac ttg tga cgcaataatt atacatactt attaatggat ttcatgagtt 2 02 0 

His Asn Leu 

660 

50 ttttggtttg aattgttgct gcgagattag gtgaatatca gttgtgtaac tatatctttt 2080 
tcctatagtt tgttcaaatt gaataaaaca tttttttgea gtttaaccac 213 0 

55 <210> 14 
<211> 659 
<212> PRT 

<213> Arabidopsis thaliana 
60 <400> 14 

Met Ala Asn Ser Pro Ala Ala Pro Ala Pro Thr Thr Thr Thr Gly Gly 



WO 2004/078983 





1 








5 










Asp 


Ser 


Arg 


Arg 


Arg 


Leu 


Ser 


Ala 










20 












Arg 


Phe 


Arg 


Arg Asn 


Ser 


Lys 


Gly 


5 






35 










40 




Pro 


Phe 


Asn 


He 


He 


Asn 


Phe 


Ser 






50 










55 






Cys 


Cys 


Phe 


Thr 


Lys 


Phe 


Gin 


He 




65 










70 






10 


Leu 


Ser 


Ala 


Thr 


Leu 


Phe 


Thr 


He 












85 










His 


Ser 


Leu 


Ser 


His 


Ser 


Ser 


Ser 










100 












Pro 


Arg 


Tyr 


Phe 


Ser 


Asp 


Leu 


Asp 


15 






115 










120 




Thr 


Leu 


Glu 


Asn 


He 


Glu 


Glu Gly 






130 










135 






Asp 


Ser 


Asn 


Glu 


He 


Gin 


Arg 


Trp 




145 










150 






20 


Asn 


Gly 


Asp 


Glu 


Glu 


Lys 


Val 


Val 












165 










Lys 


Asn 


Val 


Thr Trp 


Asp 


Ala 


Leu 










180 












Gin 


Glu 


Thr 


Glu 


Val 


Pro 


Val 


Cys 


25 






195 










200 




Pro 


Thr 


Arg 


Arg Leu 


Asp 


Leu 


He 






210 










215 






Glu 


Gly 


Asn 


Trp 


Ser 


Arg 


Asp Val 




225 










230 






30 


Ala 


Ala 


Thr 


Val 


Ala 


Ala 


Ser 


Ala 












245 










Phe 


Phe 


Val 


Ser 


Arg 


Cys 


Phe 


Pro 










260 












Asp 


Leu 


Val 


Ser 


Arg 


Arg 


Gly Asp 


35 






275 










280 




Asp 


Thr 


Leu 


Arg 


Asp 


Lys 


Leu 


Gin 






290 










295 






Ser 


Leu 


Pro 


Leu Gly 


He 


Gin Asp 




305 










310 






40 


Arg 


Glu 


Ala 


Tyr 


Ala 


Thr 


He 


Leu 












325 










Gly 


Ala 


He 


Ala 


Ala 


Ala 


Gin 


Ser 










340 












Asp 


Leu 


Val 


He 


Leu 


Val 


Asp 


Asp 


45 






355 










360 




Gly 


Leu 


Glu 


Ala 


Ala 


Gly 


Trp 


Gin 






370 










375 






Asn 


Pro 


Lys 


Ala 


Glu 


Lys 


Asp 


Ala 




385 










390 






50 


Phe 


Arg 


Leu 


Trp 


Gin 


Leu 


Thr 


Asp 












405 










Ala 


Asp 


Leu 


Leu 


He 


Leu 


Arg 


Asn 










420 












Glu 


lie 


Ser 


Ala 


Thr 


Gly 


Asn 


Asn 


55 






435 










440 




Met 


Val 


He 


Glu 


Pro 


Cys 


Asn 


Cys 






450 










455 






lie 


Asn 


Glu 


He 


Glu 


Ser 


Tyr 


Asn 




465 










470 






60 


Glu 


Val 


Phe 


Thr 


Trp 


Trp 


His 


Arg 



485 



PCT/EP2004/002096 



29 





10 










15 




Ser 


He 


Glu 


Ala 


He 


Cys 


Lys 


Arg 


25 










30 






Gly 


Gly Arg Ser Asp Met 


Val 


Lys 










45 








Thr 


Gin 


Asp 


Lys 


Asn 


Ser 


Ser 


Cys 








60 










Val 


Lys 


Leu 


Leu 


Leu 


Phe 


He 


Leu 






75 










80 


He 


Tyr 


Ser 


Pro 


Glu 


Ala 


Tyr 


His 




90 










95 




Arg 


Trp 


He 


Trp 


Arg 


Arg 


Gin 


Asp 


105 










110 






He 


Asn 


Trp 


Asp 


Asp 


Val 


Thr 


Lys 










125 








Arg 


Thr 


He 


Gly Val 


Leu 


Asn 


Phe 








140 










Arg 


Glu 


Val 


Ser 


Lys 


Ser 


Lys 


Asp 






155 










160 


Val 


Leu 


Asn 


Leu 


Asp 


Tyr 


Ala 


Asp 




170 










175 




Tyr 


Pro 


Glu 


Trp 


He 


Asp 


Glu 


Glu 


185 










190 






Pro 


Asn 


He 


Pro 


Asn 


He 


Lys 


Val 










205 








Val 


Val 


Lys 


Leu 


Pro 


Cys 


Arg 


Lys 








220 










Gly 


Arg 


Leu 


His 


Leu 


Gin 


Leu 


Ala 






235 










240 


Lys 


Gly 


Phe 


Phe 


Arg 


Gly 


His 


Val 




250 










255 




He 


Pro 


Asn 


Leu 


Phe 


Arg 


Cys 


Lys 


265 










270 






Val 


Trp 


Leu 


Tyr 


Lys 


Pro 


Asn 


Leu 










285 








Leu 


Pro 


Val 


Gly 


Ser 


Cys 


Glu 


Leu 








300 










Arg 


Pro 


Ser Leu Gly Asn 


Pro 


Lys 






315 










320 


His 


Ser 


Ala 


His 


Val 


Tyr 


Val 


Cys 




330 










335 




He 


Arg 


Gin 


Ser Gly 


Ser 


Thr 


Arg 


345 










350 






Asn 


He 


Ser 


Gly Tyr 


His 


Arg 


Ser 










365 








He 


Arg 


Thr 


He 


Gin 


Arg 


He 


Arg 








380 










Tyr 


Asn 


Glu 


Trp 


Asn 


Tyr 


Ser Lys 






395 










400 


Tyr 


Asp 


Lys 


He 


He 


Phe 


He 


Asp 




410 










415 




He 


Asp 


Phe 


Leu 


Phe 


Ser 


Met 


Pro 


425 










430 






Gly 


Thr 


Leu 


Phe 


Asn 


Ser 


Gly Val 










445 








Thr 


Phe 


Gin 


Leu 


Leu 


Met 


Glu 


His 








460 










Gly 


Gly Asp 


Gin 


Gly 


Tyr 


Leu 


Asn 






475 










480 


lie 


Pro 


Lys 


His 


Met 


Asn 


Phe 


Leu 




490 










495 





WO 2004/078983 



PCT/EP2004/002096 



30 



Lys 


His 


Phe. 


Trp 
500 


He 


Gly 


Asp 


Glu Asp 
505 


Asp 


Ala 


Lys 


Arg 


Lys 
510 


Lys 


Thr 


Glu 


Leu 


Phe 
515 


Gly 


Ala 


Glu 


Pro 


Pro Val 
520 


Leu 


Tyr 


Val 


Leu 
525 


His 


Tyr 


Leu 


Gly 


Met 
530 
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Pro 


Trp 
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Cys 
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He 
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He 
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Trp 


545 










550 








555 
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Trp 


Met 


Val 
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Glu 
570 


Leu 


His 


Gin 


Phe 


Cys 
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Arg 
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Lys 
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Gin 


Lys 
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Gin Leu 
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Asp 


Arg 


Arg 
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Asn 


Tyr 


Ala 


Asp 


Gly His 
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Trp 


Lys 


He 


Arg 
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Val 
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Asp 
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Arg 
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Lys 


He 


Cys 


He Asp Lys 


Leu 


Cys 


Asn 


Trp 


Lys 


Ser 


Met 
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Leu 


Arg 


His 


Trp 
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Glu 


Ser 


Asn Trp 


Thr 


Asp 
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Glu 


Ser 


Phe 


Val 
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630 








63 5 
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Pro 


Thr 


Pro 


Pro 
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He 
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Val Asp 
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Arg 
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Ser 
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25 <210> 15 

<211> 15294 
<212> DNA 

<213> Artificial Sequence 
30 <22 0> 



<220> 

<223> Description of Artificial Sequence: 



vector 



35 



40 



45 



50 



55 



60 



<400> 15 

ggccgggagg 

ggcgcagccc 

aggtta.gcgg 

ggacagcccc 

tgtcaaggat 

ggcacttatc 

ttcgccgatt 

atctgtcaac 

gtcagtgagg 

acacggcttc 

ggcgagggca 

tgatgtactt 

tggcaatcac 

gggcggacca 

agcagggcga 

agtttcagca 

tgctcatagt 

gacaggctca 

cagtacacct 

ttgccctcat 

agcaccgcca 

acttcaccta 

ctttggcaaa 

atgaccccga 

ggacaggtat 

gggaaacgcc 



gttcgagaag 
tggttaaaaa 
tggccgaaaa 
tcaaatgtca 
cgcgcccctc 
cccaggcttg 
tgcgaggctg 
gccgcgccgg 
gccaagtttt 
gacggcgttt 
accagcccgg 
caccagctcc 
gcgcaccccc 
cgcccatcat 
ggatcgtggc 
ggccgcccag 
ccacgacgcc 
tgccggccgc 
tgataggtgg 
ctgttacgcc 
ggtgcgaata 
tcctgcccgg 
atcctgtata 
agcagggtta 
ccggtaagcg 
tggtatcttt 



ggggggcacc 

caaggtttat 
acgggcggaa 
ataggtgcgc 
atctgtcagt 
tccacatcat 
gccagctcca 
gtgagtcggc 
ccgcgaggta 
ctggcgcgtt 
tgagcgtcgc 
gcgaagtcgc 
cggccgtttt 
gaccttgcca 
atcaccgaac 
gcggcccagg 
cgtgattttg 
cgccgccttt 
gctgcccttc 
ggcggtagcc 
agggacagtg 
ctgacgccgt 
tcgtgcgaaa 
tgcagcggaa 
gcagggtcgg 
atagtcctgt 



ccccttcggc 
aaatattggt 
acccttgcaa 
ccctcatctg 
agtcgcgccc 
ctgtgggaaa 
cgtcgccggc 
ccctcaagtg 
tccacaacgc 
tgcagggcca 
aaaggcgctc 
tcttcttgat 
agcggctaaa 
agctcgtcct 
cgcgccgtgc 
tcgccattga 
tagccctggc 
tcctcaatcg 
ctggttggct 
ggccagcctc 
aagaaggaac 
tggatacacc 
aaggatggat 
aagcgccacg 
aacaggagag 
cgggtttcgc 



gtgcgcggtc 
ttaaaagcag 
atgctggatt 
tcagcactct 
ctcaagtgtc 
ctcgcgtaaa 
cgaaatcgag 
tcaacgtccg 
cggcggccgc 
tagacggccg 
ggtcttgcct 
ggagcgcatg 
aaagtcatgg 
gcttctcttc 
gcgggtcgtc 
tgcgggccag 
cgacggccag 
ctcttcgttc 
tggtttcatc 
gcagagcagg 
acccgctcgc 
aaggaaagtc 
ataccgaaaa 
cttcccgaag 
cgcacgaggg 
cacctctgac 



acgcgcacag 
gttaaaagac 
ttctgcctgt 
gcccctcaag 
aataccgcag 
atcaggcgtt 
cctgcccctc 
cccctcatct 
ggtgtctcgc 
ccagcccagc 
tgctcgtcgg 
gggacgtgct 
ctctgccctc 
gatcttcgcc 
ggtgagccag 
ctcgcggacg 
caggtaggcc 
gtctggaagg 
agccatccgc 
attcccgttg 

gggtgggcct 
tacacgaacc 
aatcgctata 
ggagaaaggc 
agcttccagg 
ttgagcgtcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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atttttgtga tgctcgtcag gggggcggag 
tttacggttc ctggcctttt gctggccttt 
tgattctgtg gataaccgta ttaccgcctt 
aacgaccgag cgcagcgagt cagtgagcga 
5 agaggccgag cgcggccgtg aggcttggac 
gggctgctac gggcgtctga cgcggtggaa 
gctgtagtga gtgggttgcg ctccggcagc 
ccttcaacgt tcctgacaac gagcctcctt 
ccctgctcga acgctgcgtc cggaccggct 
10 agcggcgaga gcggagcctg ttcaacggtg 
gcctgctcct caagcacggc cccaacagtg 
gcgtccccgg ccgaaaaacc cgcctcgcag 
atctgcggtg cgcccggtcg cgtgccggca 
agcgcctgcc tgaagctgcg ggcattcccg 

15 ctcggcaccg aatgcgtatg attctccgcc 
gcccgcttgt tcctgaagtg ccagtaaagc 
agtttgcgtg tcgtcagacc gtctacgccg 
atcactgtat tcggctgcaa ctttgtcatg 
tgtccaccaa cttatcagtg ataaagaatc 

20 tccggaggcc agacgtgaaa cccaacatac 
tcgacgctgt cggcatcggc ctgattatgc 
ttcactcgaa cgacgtcacc gcccactatg 
aatttgcctg cgcacctgtg ctgggcgcgc 
tgctcgtctc gctggccggc gccaagatct 

25 aatggacgaa cggataaacc ttttcacgcc 
ctcttttctc ttaggtttac ccgccaatat 
aggcgggaaa cgacaatctg atcatgagcg 
gccgatgacg cgggacaagc cgttttacgt 
agccactcag ccgatctgaa ttcccgatct 

30 ttatcctagt ttgcgcgcta tattttgttt 
actctaatca taaaaaccca tctcataaat 
tgcttaacgt aattcaacag aaattatatg 
aatcttaaga aactttattg ccaaatgttt 
atcatgttac aaactttttt gctgtgagca 

35 atctgataga taaagcacat agcttgggtt 
atgcaagcaa taatgccact gatggcctca 
aatcctactt tcatgctgga gagcaagggt 
actatgccaa ttgggaggtt cttcgtttcc 
agtataactt tgacggattt cgatttgatg 

40 gaatcaatat gggatttaca ggaaactata 
atgctgtggt ctatttaatg ttggccaata 
ctgttattgc cgaagatgtt tctggtatgc 
gaattggttt tgattaccgc ctggcaatgg 
agaataaaaa tgatgaagat tggtccatga 

45 gatatacaga gaagtgtata gcatatgcgg 
agaccattgc atttctccta atggacaaag 
atgcttctcc tgttattgat cgaggaattg 
tggccttggg aggagagggg tacctcaatt 
ggattgactt ccctagagag ggcaataatt 

50 accttgcgga tagcgaacac ttgagataca 
attcgctcga tgaaaagttc tcattcctcg 
atgatgagaa gaaggttgtt gtgtttgaac 
acccaaataa cacatacgaa gggtataaag 
ttgcactgga cagtgatgct tgggaatttg 

55 accatttcac atcaccagaa ggaatacctg 
caaattcctt caaagtgctg tctcctgcgc 
aacgcatgtc agaaactgaa gtttaccaga 
ccaatatcga ggagagtgac gagaaactta 
ttgacgaact catgtcagaa actgaagttt 

60 caacagccag tatcgaggag agtgacgaga 
gtacgtggtt atcattggat gtgggattcc 
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cctatggaaa aacgccagca acgcggcctt 1620 
tgctcacatg ttctttcctg cgttatcccc 1680 
tgagtgagct gataccgctc gccgcagccg 1740 
ggaagcggaa gagcgccaga aggccgccag 1800 
gctagggcag ggcatgaaaa agcccgtagc 1860 
agggggaggg gatgttgtct acatggctct 1920 
ggtcctgatc aatcgtcacc ctttctcggt 1980 
ttcgccaatc catcgacaat caccgcgagt 2 040 
tcgtcgaagg cgtctatcgc ggcccgcaac 2100 
ccgccgcgct cgccggcatc gctgtcgccg 2160 
aagtagctga ttgtcatcag cgcattgacg 222 0 
aggaagcgaa gctgcgcgtc ggccgtttcc 2280 
tggatgcgcg cgccatcgcg gtaggcgagc 2340 
atcagaaatg agcgccagtc gtcgtcggct 2400 
agcatggctt cggccagtgc gtcgagcagc 2460 
gccggctgct gaacccccaa ccgttccgcc 2520 
acctcgttca acaggtccag ggcggcacgg 2580 
cttgacactt tatcactgat aaacataata 2640 
cgcgcgttca atcggaccag cggaggctgg 2700 
ccctgatcgt aattctgagc actgtcgcgc 2760 
cggtgctgcc gggcctcctg cgcgatctgg 282 0 
gcattctgct ggcgctgtat gcgttggtgc 2880 
tgtcggatcg tttcgggcgg cggccaatct 2940 
ggggaaccct gtggttggca tgcacataca 3 000 
cttttaaata tccgattatt ctaataaacg 3 060 
atcctgtcaa acactgatag tttaaactga 312 0 
gagaattaag ggagtcacgt tatgaccccc 3180 
ttggaactga cagaaccgca acgttgaagg 3240 
agtaacatag atgacaccgc gcgcgataat 33 00 
tctatcgcgt attaaatgta taattgcggg 33 60 
aacgtcatgc attacatgtt aattattaca 3420 
ataatcatcg caagaccggc aacaggattc 3480 
gaacgatcgg ggaaattcga gctcggtacc 3540 
gtagatatgg aaacccggag gacctaaagt 3600 
tacaggttct ggtggatgta gttcacagtc 3 660 
atggctttga tattggccaa ggttctcaag 372 0 
accataagtt gtgggatagc aggctgttca 3780 
ttctttccaa cttgaggtgg tggctagaag 3 840 
gaataacttc tatgctgtat gttcatcatg 3900 
atgagtattt cagcgaggct acagatgttg 396 0 
atctgattca caagattttc ccagacgcaa 402 0 
cgggccttgg ccggcctgtt tctgagggag 4080 
caatcccaga taagtggata gattatttaa 4140 
aggaagtaac atcgagtttg acaaatagga 42 00 
agagccatga tcagtctatt. gtcggtgaca 4260 
agatgtattc tggcatgtct tgcttgacag 432 0 
cgcttcacaa gatgatccat tttttcacaa 43 80 
tcatgggtaa cgagtttggc catcctgagt 4440 
ggtgttatga caaatgtaga cgccagtgga 4500 
agtttatgaa tgcatttgat agagctatga 4560 
catcaggaaa acagatagta agcagcatgg 462 0 
gtggtgacct ggtatttgta ttcaacttcc 4680 
ttggatgtga cttgccaggg aagtacagag 4740 
gtggccatgg aagagctggt catgatgttg 4800 
gagttccaga aacaaatttc aatggtcgtc 4860 
gaacatgtgt ggcttattac agagttgacg 492 0 
cagacatttc tagtgagcta ctaccaacag 4980 
aagattcgtt atctacaaat atcagtaacg 5040 
accagacaga catttctagt gagctactac 5100 
aacttaaaga ttcattatct acaaatatca 5160 
cgcctcttta attatggaaa ctgggaggta 522 0 
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cttaggtatc ttctctcaaa tgcgagatgg 
agattcgatg gtgtgacatc aataatgtat 
gggaactaca aggaatactt tggactcgca 
ctggtcaacg atcttattca tgggcttttc 
5 gcg;gaatgcc gacattttgt attcccgttc 
tgcatatggc aattgctgat aaatggattg 
gagtgggtga tattgttcat acactgacaa 
acgctgaaag tcatgatcaa gctctagtcg 
acaaggatat gtatgatttt atggctctgg 
10 ggatagcatt gcacaagatg attaggcttg 
taaatttcat gggaaatgaa ttcggccacc 
aacacctctc tgatggctca gtaattcccg 
ggagatttga cctgggagat gcagaatatt 
gggctatgca gtatcttgaa gataaatatg 
15 cacgaaagga tgaaggagat aggatgattg 
ttaattttca ctggacaaaa agctattcag 
aatacaaggt tgccttggac tcagatgatc 
ataatgccga atatttcacc tttgaaggat 
tgtatgcacc tagtagaaca gcagtggtct 
20 aagaagaagt agcagtagta gaagaagtag 
cgcgttgaaa gatttgaagg ctacatagct 
attggaggag atgagtaaaa gttaccactt 
gaggaggtgc ctgccttatt tgtagcaggt 
tggctatccc ttagcagaag gcaactgtgg 
25 agtattatgg gccctctctt tgttgattca 
ggcccgtccg gttctgtctc ctagtatcta 
ccacttgaca ttcctatgtc tcgtgttaat 
atctaggtac tggtactggt ccctccctcc 
ttgtattcca aattactgta aatatatttt 
30 atgaatgcac aattctaaag gttgaataca 
tgcattttag tattaagatt gagatgcatg 
ggaaggatgg ttaaaaacaa cattcaatgt 
tttaacgagc ttcccgtata catcataaca 
atacccaaac aggaagatac tgtcaagtat 
35 tttcgaggat tcataaatca taatatctgg 
tatcctgatt actccgtcaa cagccaaata 
tacaagatac tgttgcattt gttaagtaat 
tcgttaaagc ttcacgctgc cgcaagcact 
cacgtagaaa gccagtccgc agaaacggtg 
40 tatctggaca agggaaaacg caagcgcaaa 
atggcgatag ctagactggg cggttttatg 
ggcgccctct ggtaaggttg ggaagccctg 
aaggatctga tggcgcaggg gatcaagatc 
gacccccgcc gatgacgcgg gacaagccgt 
45 ttgaaggagc cactcagccg cgggtttctg 
aaccattatt gcgcgttcaa aagtcgccta 
tcaaaaatgc tccactgacg ttccataaat 
tcactctcaa tccagatctc gactctagtc 
atggattgca cgcaggttct ccggccgctt 
50 cacaacagac aatcggctgc tctgatgccg 
cggttctttt tgtcaagacc gacctgtccg 
cgcggctatc gtggctggcc acgacgggcg 
ctgaagcggg aagggactgg ctgctattgg 
cfccaccttgc tcctgccgag aaagtatcca 
55 cgcttgatcc ggctacctgc ccattcgacc 
gtactcggat ggaagccggt cttgtcgatc 
tcgcgccagc cgaactgttc gccaggctca 
tcgtgaccca tggcgatgcc tgcttgccga 
gattcatcga ctgtggccgg ctgggtgtgg 
60 cccgtgatat tgctgaagag cttggcggcg 
gtatcgccgc tcccgattcg cagcgcatcg 
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tggttggatg agttcaaatt tgatggattt 5280 
actcaccacg gattatcggt gggattcact 53 40 
actgatgtgg atgctgttgt gtatctgatg 5400 
cagatgcaat taccattggt gaagatgtta 5460 
aagatggggg tgttggcttt gactatcggc 552 0 
agttgctcaa gaaacgggat gaggattgga 5580 
atagaagatg gtcggaaaag tgtgtttcat 5640 
gtgataaaac tatagcattc tggctgatgg 5700 
atagaccatc aacatcatta atagatcgtg 5760 
taactatggg attaggagga gaagggtacc 5820 
ctgagtggat tgatttccct agggctgaac 5880 
gaaaccaatt cagttatgat aaatgcagac 5940 
taagataccg tgggttgcaa gaatttgacc 6000 
agtttatgac ttcagaacac cagttcatat 6060 
tatttgaaaa aggaaaccta gtttttgtct 612 0 
actatcgcat aggctgcctg aagcctggaa 6180 
cactttttgg tggcttcggg agaattgatc 6240 
ggtatgatga tcgtcctcgt tcaattatgg 63 00 
atgcactagt agacaaagaa gaagaagaag 6360 
tagtagaaga agaatgaacg aacttgtgat 642 0 
ctagagtcga cctgcatgaa atcagaaata 6480 
gttgagctgt gtgagtgagt gagtgagaat 6540 
ttcagtgaca cgtgtcaaga gaatagcggg 6600 
acactgtatt atagggaaat gctcatcgac 6660 
cggctggact tcaacttggg ccttgcaatg 6720 
aaaaactaaa ccaactccct cctaccgcta 6780 
taaattatta ttatagtaat taaaaataat 6840 
actagaatat tagttacttc ccccttagct 6900 
ctaatttttt acgacaaaca agatctaatt 6960 
ttactttact tggtttagcc tatattaagt 702 0 
gttctattac aaaattgata cactgctaaa 7080 
ttgttacatt tcttcctatt gtattttttt 7140 
tgtctccgtt ccacttggca ggaaaaaaaa 7200 
atccatagat gaggacttaa tggataggct 7260 
cggaggagtc aattaaatac ttgtggtttg 7320 
gaaaagtttg aaaagagaga aaggatttgg 73 80 
gaacaaaacg gagtaacata attttctatc 7440 
cagggcgcaa gggctgctaa ggaagcggaa 7500 
ctgaccccgg atgaatgtca gctactgggc 7560 
gagaaagcag gtagcttgca gtgggcttac 7620 
gacagcaagc gaaccggaat tgccagctgg 7680 
caaagtaaac tggatggctt tcttgccgcc 7740 
atgagcggag aattaaggga gtcacgttat 7800 
tttacgtttg gaactgacag aaccgcaacg 7860 
gagtttaatg agctaagcac atacgtcaga 7920 
aggtcactat cagctagcaa atatttcttg 7980 
tcccctcggt atccaattag agtctcatat 8040 
gagggcccat gggagcttgg attgaacaag 8100 
gggtggagag gctattcggc tatgactggg 8160 
ccgtgttccg gctgtcagcg caggggcgcc 8220 
gtgccctgaa tgaactgcag gacgaggcag 82 80 
ttccttgcgc agctgtgctc gacgttgtca 8340 
gcgaagtgcc ggggcaggat ctcctgtcat 8400 
tcatggctga tgcaatgcgg cggctgcata 8460 
accaagcgaa acatcgcatc gagcgagcac 8520 
aggatgatct ggacgaagag catcaggggc 8580 
aggcgcgcat gcccgacggc gaggatctcg 8640 
atatcatggt ggaaaatggc cgcttttctg 87 00 
cggaccgcta tcaggacata gcgttggcta 87 60 
aatgggctga ccgcttcctc gtgctttacg 8820 
ccttctatcg ccttcttgac gagttcttct 8880 
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gagcgggacc caagctagct tcgacggatc 
atttatgtat tacacataat atcgcactca 
tataatcagt tattgaaata tttctgaatt 
cttggactat aatacctgac ttgttatttt 
5 aagatgggaa ttaattcact ggccgtcgtt 
gttacccaac ttaatcgcct tgcagcacat 
gaggcccgca ccgatcgccc ttcccaacag 
cgctttcttc ccttcctttc tcgccacgtt 
ggggctccct ttagggttcc gatttagtgc 

10 tttgggtgat ggttcacgta gtgggccatc 
gttggagtcc acgttcttta atagtggact 
tatctcgggc tattcttttg atttataagg 
caggattttc gcctgctggg gcaaaccagc 
caggcggtga agggcaatca gctgttgccc 

15 gtacattaaa aacgtccgca atgtgttatt 
acaatatatc ctgccaccag ccagccaaca 
accactcgat acaggcagcc catcagtccg 
gcggcagact ttgctcatgt taccgatgct 
tttgaaacac ggatgatctc gcggagggta 

20 10020 

gctgcctgtg atcaaatatc atctccctcg 
10080 

atttctcgct taaccgtgac aggctgtcga 
10140 

25 tcgctggata aagccgctga ggaagctgag 
10200 

cgatatcaac tcccctatcc attgctcacc 
10260 

gactgtcggc ctgatgcatc cccggctgat 
30 10320 

agcccagtaa ggaaacaact gtaggttcga 
10380 

aggttaaacc cgctccgatc aggccgagcc 
10440 

35 gcatcgggat tggcggatca aacactaaag 
10500 

ccagttgcca ggcggtaaag gtgagcagag 
10560 

cggttccgaa cgccatggaa accgcccccg 
40 10620 

gcgctgcgtt tggtgtcaac accaacagcg 
10680 

ggaccgccat caatcgtatc gggctaccta 
10740 

45 gcggctgcac agcgcctacc gtcgccgcga 
10800 

acaacaagct ccagaatagc gaaatattaa 
10860 

agattcccgt tggaatctgt cggacgatca 
50 10920 

gcagcagcat accggcgacc cctcggcctc 
10980 

gatgcgcctt gtgagcgtcc ttggggccgt 
11040 

55 -tctcgccgtc gatgtaggcg ccgaatgcca 
11100 

ccatgggctt tttctcctcg tgctcgtaaa 
11160 

gggccgacaa tcggatctcg cggaaatcct 
60 11220 
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ccccgatgag ctaagctagc tatatcatca 8940 
gtctttcatc tacggcaatg taccagctga 9 000 
taaacttgca tcaataaatt tatgtttttg 9060 
atcaataaat atttaaacta tatttctttc 9120 
ttacaacgtc gtgactggga aaaccctggc 9180 
ccccctttcg ccagctggcg taatagcgaa 9240 
ttgcgcagcc tgaatggcgc ccgctccttt 93 00 
cgccggcttt ccccgtcaag ctctaaatcg 9360 
tttacggcac ctcgacccca aaaaacttga 9420 
gccctgatag acggtttttc gccctttgac 9480 
cttgttccaa actggaacaa cactcaaccc 9540 
gattttgccg atttcggaac caccatcaaa 9600 
gtggaccgct tgctgcaact ctctcagggc 9660 
gtctcactgg tgaaaagaaa aaccacccca 9720 
aagttgtcta agcgtcaatt tgtttacacc 9780 
gctccccgac cggcagctcg gcacaaaatc 9840 
ggacggcgtc agcgggagag ccgttgtaag 9900 
attcggaaga acggcaacta agctgccggg 9960 
gcatgttgat tgtaacgatg acagagcgtt 

cagagatccg aattatcagc cttcttattc 

tcttgagaac tatgccgaca taataggaaa 

tggcgctatt tctttagaag tgaacgttga 

gaatggtaca ggtcggggac ccgaagttcc 

cgaccccaga tctagatctg gggctgagaa 

gtcgcgagat cccccggaac caaaggaagt 

acgccaggcc gagaacattg gttcctgtag 

ctactggaac gagcagaagt cctccggccg 

gcacgggagg ttgccacttg cgggtcagca 

ccaggcccgc tgcgacgccg acaggatcta 

ccacgcccgc agttccgcaa atagccccca 

gcagagcggc agagatgaac acgaccatca 

ccccgcccgg caggcggtag accgaaataa 

gtgcgccgag gatgaagatg cgcatccacc 

tcacgagcaa taaacccgcc ggcaacgccc 

gctgttcggg ctccacgaaa acgccggaca 

cctcctgttt gaagaccgac agcccaatga 

cggcatctcg caaccgttca gcgaacgcct 

cggacccgaa catctctgga gctttcttca 

gcacgtcggc cgctccaagc cgtcgaatct 
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gagccttaat cacaattgtc aattttaatc 
11280 

cgtgcgtccc gagcgatact gagcgaagca 
11340 

5 aaatgccagt aaagcgctgg cfcgctgaacc 
11400 

gcgtttgcaa tgcaccaggt catcattgac 
11460 

caactcttcg caggcttcgc cgacctgctc 
10 11520 

tccgcacatg aggcggaagg tttccagctt 
11580 

atagtcgaac atccgtcggg ccgtcggcga 
11640 

15 cgtgtagtgg tcgccagcaa acagcacgac 
11700 

ggacgttttc ttgccacggt ccaggacgcg 
11760 

gtgcccaacg cggtcggacg tgaagcccat 
20 11820 

ctcggccttc gtgtaatacc ggccattgat 
11880 

gaacgtgaag gtgatcggct cgccgatagg 
11940 

25 ccacaccagt tcgtcatcgt cggcccgcag 
12000 

cttgttgacg tggaaaatga ccttgttttg 
12060 

cgtggtgaac agggcagagc gggccgtgtc 
30 12120 

cggcgcaata tcgaacaagg aaagctgcat 
12180 

caacgcggcc tgcttggcct cgctgacctg 
12240 

35 cttcttggtc gtcatagttc ctcgcgtgtc 
12300 

ctcctgttcg agacgacgcg aacgctccac 
12360 

agccagttgc acgctgtcgc gctcgatctt 
40 12420 

ggactggaag gtttcgcggg gcgcacgcat 
12480 

ctcggcggaa aaccccgcgt cgatcagttc 
12540 

45 attcattcac cctccttgcg ggattgcccc 
12600 

ctgatttgac ccgcctggtg ccttggtgtc 
12660 

ggtcccgtag accgtctggc cgtccttctc 
50 12720 

gaataccagc gaccccttgc ccaaatactt 
12780 

cttgatgcgg aagaagtcgg tgcgctcctg 
12840 

55 ggtactaaaa caattcatcc agtaaaatat 
12900 

tccccagtaa gtcaaaaaat agctcgacat 
12960 

accggacgca gaaggcaatg tcataccact 
60 13 020 
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taaagccact tactttgcca tctttcacaa 
13080 

aaaagacaag ttcctcttcg ggcttttccg 
13140 

5 ctttaaatgg agtgtcttct tcccagtttt 
13200 

gtaagtaatc caattcggct aagcggctgt 
13260 

tgtcgatgga gtgaaagagc ctgatgcact 
10 13320 

tttgttcatc ttcatactct tccgagcaaa 
13380 

tgctccagcc atcatgccgt tcaaagtgca 
13440 

15 gccatagcat catgtccttt tcccgttcca 
13500 

ccgtcatttt taaatatagg ttttcatttt 
13560 

acattccttc cgtatctttt acgcagcggt 
20 13620 

atattctcat tttagccatt tattatttcc 
13680 

ccaagaagct aattataaca agacgaactc 
13740 

25 taaataccag aaaacagctt tttcaaagtt 
13800 

gacggagccg attttgaaac cacaattatg 
13860 

atgatggtgt ttttgaggtg ctccagtggc 
30 13920 

agctactgac ggggtggtgc gtaacggcaa 
13980 

ctctcactgc cgtaaaacat ggcaactgca 
14040 

35 gcaccagaaa atcattgata tggccatgaa 
14100 

tatgggcgtt ggcctcaaca cgattttacg 
14160 

acctcgcgca tacagccggg cagtgacgtc 
40 14220 

ggctatgtcg gggctaaatc gcgccagcgc 
14280 

aagacggttg ttgcg : cacgt attcggtgaa 
14340 

45 agcctgctgt caccctttga cgtggtgata 
14400 

tcccgcctga agggaaagct gcacgtaatc 
14460 

cataacctga atctgaggca gcacctggca 
50 14520 

aaatcggtgg agctgcatga caaagtcatc 
14580 

taagttggag tcattaccca attatgatag 
14640 

55 gtttcaagca ttagtccatg caagttttta 
14700 

agcgcgctgc ctatgccttg ccccctgaaa 
14760 

aagatatatt atcttatcag tattgtcaat 
60 14820 
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agatgttgct 


gtctcccagg 


tcgccgtggg 


tctttaaaaa 


atcatacagc 


tcgcgcggat 


cgcaatccac 


atcggccaga 


tcgttattca 


ctaagctatt 


cgtataggga 


caatccgata 


ccgcatacag 


ctcgataatc 


ttttcagggc 


ggacgccatc 


ggcctcactc 


atgagcagat 


ggacctttgg 


aacaggcagc 


tttccttcca 


catcataggt 


ggtcccttta 


taccggctgt 


ctcccaccag 


cttatatacc 


ttagcaggag 


atttttcgat 


cagttttttc 


aattccggtg 


ttcctctttt 


ctacagtatt 


taaagatacc 


caattcactg 


ttccttgcat 


tctaaaacct 


gttttcaaag 


ttggcgtata 


acatagtatc 


ggtgatgctg 


ccaacttact 


gatttagtgt 


ttctgtgtct 


atcagctgtc 


cctcctgttc 


aagcaccgcc 


ggacatcagc 


gctatctctg 


gttcacttac 


accgcttctc 


aacccggtac 


tggcgttgga 


tgccgggcaa 


cagcccgcat 


tcacttaaaa 


aactcaggcc 


gcagtcggta 


atcgtctgcg 


cggaaatgga 


cgaacagtgg 


tggctgtttt 


acgcgtatga 


cagtctccgg 


cgcactatgg 


cgacgctggg 


gcgtcttatg 


tacrataacgg 


atagctagcc 


gctgtatgaa 


agcaagcgat 


atacgcagcg 


aattgagcgg 


cggctgggac 


ggaagtcgct 


gtcgttctca 


gggcattatc 


tgaacataaa 


acactatcaa 


aatttacaag 


ctataaggtt 


attgtcctgg 


tgctttgccc 


attctataga 


tatattgata 


tccttacata 


cggcgatatc 


ttctatataa 


atattcaagg 


caatctgcct 


cctcatcctc 
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ttcatcctct tcgtcttggt agctttttaa 
14880 

ggtccaattc tcgttttcat acctcggtat 
14940 

5 gggtttatcg cacccccgaa cacgagcacg 
15000 

aaggtaaaaa ttgccggccc cgccatgaag 
15060 

ggcaggccgc cacccaggcc gccgccctca 
10 15120 

gccagcacct gcggcacgtc aatgcttccg 
15180 

gttactgccc cgatcccggc aatggcaagg 
15240 

15 ccgttcgcgg ccgaggggcg cagcccctgg 
15294 
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atatggcgct tcatagagta attctgtaaa 
aatcttacct atcacctcaa atggttcgct 
gcacccgcga ccactatgcc aagaatgccc 
tccgtgaatg ccccgacggc cgaagtgaag 
ctgcccggca cctggtcgct gaatgtcgat 
ggcgtcgcgc tcgggctgat cgcccatccc 
actgccagcg ctgccatttt tggggtgagg 
ggggatggga ggcccgcgtt agcg 



<210> 16 
20 <211> 2019 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
25 <221> CDS 

<222> (1) . . (2010) 

<400> 16 

atg gcg gcg gca aca aca aca aca aca aca tct tct teg ate tec ttc 48 

30 Met Ala Ala Ala Thr Thr Thr Thr Thr Thr Ser Ser Ser lie Ser Phe 
15 10 15 

tec acc aaa cca tct cct tec tec tec aaa tea cca tta cca ate tec 96 
Ser Thr Lys Pro Ser Pro Ser Ser Ser Lys Ser Pro Leu Pro lie Ser 
35 20 25 30 

aga ttc tec etc cca ttc tec eta aac ccc aac aaa tea tec tec tec 144 
Arg Phe Ser Leu Pro Phe Ser Leu Asn Pro Asn Lys Ser Ser Ser Ser 
35 40 45 



40 



tec cgc cgc cgc ggt ate aaa tec age tct ccc tec tec ate tec gee 192 
Ser Arg Arg Arg Gly lie Lys Ser Ser Ser Pro Ser Ser He Ser Ala 
50 55 60 



45 gtg etc aac aca acc acc aat gtc aca acc act ccc tct cca acc aaa 240 
Val Leu Asn Thr Thr Thr Asn Val Thr Thr Thr Pro Ser Pro Thr Lys 
65 70 75 80 

cct acc aaa ccc gaa aca ttc ate tec cga ttc get cca gat caa ccc 2 88 
50 Pro Thr Lys Pro Glu Thr Phe He Ser Arg Phe Ala Pro Asp Gin Pro 

85 90 95 

cgc aaa ggc get gat att etc gtc gag get tta gaa cgt caa ggc gta 33 6 
Arg Lys Gly Ala Asp He Leu Val Glu Ala Leu Glu Arg Gin Gly Val 
55 100 105 110 

gaa acc gta ttc get tac cct gga ggt gca tea atg gag att cac caa 3 84 

Glu Thr Val Phe Ala Tyr Pro Gly Gly Ala Ser Met Glu He His Gin 
115 120 125 

60 



gee tta acc cgc tct tec tea ate cgt aac gtc ctt cct cgt cac gaa 



432 
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Ala Leu Thr Arg Ser Ser Ser lie Arg Asn Val Leu Pro Arg His Glu 

130 135 140 

caa gga ggt gta ttc gca gca gaa gga tac get cga tec tea ggt aaa 480 

5 Gin Gly Gly Val Phe Ala Ala Glu Gly Tyr Ala Arg Ser Ser Gly Lys 

145 150 155 160 

cca ggt ate tgt ata gec act tea ggt ccc gga get aca aat etc gtt 52 8 

Pro Gly He Cys He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 

10 165 170 175 

age gga tta gee gat gcg ttg tta gat agt gtt cct ctt gta gca ate 576 

Ser Gly Leu Ala Asp Ala Leu Leu Asp Ser Val Pro Leu Val Ala He 

180 185 190 



15 



35 



aca gga caa gtc cct cgt cgt atg att ggt aca gat gcg ttt caa gag 624 
Thr Gly Gin Val Pro Arg Arg Met He Gly Thr Asp Ala Phe Gin Glu 
195 200 205 



20 act ccg att gtt gag gta acg cgt teg att acg aag cat aac tat ctt 672 
Thr Pro He Val Glu Val Thr Arg Ser He Thr Lys His Asn Tyr Leu 
210 215 220 

gtg atg gat gtt gaa gat att cct agg att att gag gag get ttc ttt 720 
25 Val Met Asp Val Glu Asp He Pro Arg He He Glu Glu Ala Phe Phe 
225 230 235 240 

tta get act tct ggt aga cct gga cct gtt ttg gtt gat gtt cct aaa 768 
Leu Ala Thr Ser Gly Arg Pro Gly Pro Val Leu Val Asp Val Pro Lys 
30 245 250 255 

gat att caa caa cag ctt gcg att cct aat tgg gaa cag get atg aga 816 
Asp He Gin Gin Gin Leu Ala He Pro Asn Trp Glu Gin Ala Met Arg 
260 265 270 



tta cct ggt tat atg tct agg atg cct aaa cct ccg gaa gat tct cat 864 
Leu Pro Gly Tyr Met Ser Arg Met Pro Lys Pro Pro Glu Asp Ser His 
275 280 285 



40 ttg gag cag att gtt agg ttg att tct gag tct aag aag cct gtg ttg 912 
Leu Glu Gin He Val Arg Leu He Ser Glu Ser Lys Lys Pro Val Leu 
290 295 300 

tat gtt ggt ggt ggt tgt ttg aac tct age gat gaa ttg ggt agg ttt 960 
45 Tyr Val Gly Gly Gly Cys Leu Asn Ser Ser Asp Glu Leu Gly Arg Phe 
305 310 315 320 

gtt gag ctt acg gga ate cct gtt gcg agt acg ttg atg ggg ctg gga 1008 
Val Glu Leu Thr Gly He Pro Val Ala Ser Thr Leu Met Gly Leu Gly 
50 325 330 335 

tct tat cct tgt gat gat gag ttg teg tta cat atg ctt gga atg cat 1056 
Ser Tyr Pro Cys Asp Asp Glu Leu Ser Leu His Met Leu Gly Met His 
340 345 350 

55 

ggg act gtg tat gca aat tac get gtg gag cat agt gat ttg ttg ttg 1104 

Gly Thr Val Tyr Ala Asn Tyr Ala Val Glu His Ser Asp Leu Leu Leu 
355 360 365 

60 gcg ttt ggg gta agg ttt gat gat cgt gtc acg ggt aaa ctt gag get 1152 
Ala Phe Gly Val Arg Phe Asp Asp Arg Val Thr Gly Lys Leu Glu Ala 



10 
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370 375 380 

ttt get agt agg get aag att gtt cat att gat att gac teg get gag 1200 
Phe Ala Ser Arg Ala Lys lie Val His lie Asp lie Asp Ser Ala Glu 
385 390 395 400 

att ggg aag aat aag act cct cat gtg tct gtg tgt ggt gat gtt aag 1248 
lie Gly Lys Asn Lys Thr Pro His Val Ser Val Cys Gly Asp Val Lys 
405 410 415 

ctg get ttg caa ggg atg aat aag gtt ctt gag aac cga gcg gag gag 1296 
Leu Ala Leu Gin Gly Met Asn Lys Val Leu Glu Asn Arg Ala Glu Glu 
420 425 430 



15 ctt aaa ctt gat ttt gga gtt tgg agg aat gag ttg aac gta cag aaa 1344 
Leu Lys Leu Asp Phe Gly Val Trp Arg Asn Glu Leu Asn Val Gin Lys 
435 440 445 

cag aag ttt ccg ttg age ttt aag acg ttt ggg gaa get att cct cca 1392 
20 Gin Lys Phe Pro Leu Ser Phe Lys Thr Phe Gly Glu Ala lie Pro Pro 
450 455 460 



cag tat gcg att aag gtc ctt gat gag ttg act gat gga aaa gec ata 
Gin Tyr Ala lie Lys Val Leu Asp Glu Leu Thr Asp Gly Lys Ala lie 
25 465 470 475 480 



30 



50 



1440 



ata agt act ggt gtc ggg caa cat caa atg tgg gcg gcg cag ttc tac 1488 
lie Ser Thr Gly Val Gly Gin His Gin Met Trp Ala Ala Gin Phe Tyr 
485 490 495 

aat tac aag aaa cca agg cag tgg eta tea tea gga ggc ctt gga get 153 6 
Asn Tyr Lys Lys Pro Arg Gin Trp Leu Ser Ser Gly Gly Leu Gly Ala 
500 505 510 

35 atg gga ttt gga ctt cct get gcg att gga gcg tct gtt get aac cct 1584 
Met Gly Phe Gly Leu Pro Ala Ala lie Gly Ala Ser Val Ala Asn Pro 
515 520 525 

gat gcg ata gtt gtg gat att gac gga gat gga agt ttt ata atg aat 163 2 
40 Asp Ala lie Val Val Asp lie Asp Gly Asp Gly Ser Phe lie Met Asn 
530 535 540 

gtg caa gag eta gee act att cgt gta gag aat ctt cca gtg aag gta 1680 
Val Gin Glu Leu Ala Thr lie Arg Val Glu Asn Leu Pro Val Lys Val 
45 545 550 555 560 

ctt tta tta aac aac cag cat ctt ggc atg gtt atg caa tgg gaa gat 1728 
Leu Leu Leu Asn Asn Gin His Leu Gly Met Val Met Gin Trp Glu Asp 
565 570 575 



egg ttc tac aaa get aac cga gca cac aca ttt etc gga gat ccg get 1776 
Arg Phe Tyr Lys Ala Asn Arg Ala His Thr Phe Leu Gly Asp Pro Ala 
580 585 590 



55 cag gag gac gag ata ttc ccg aac atg ttg ctg ttt gca gca get tgc 1824 
Gin Glu Asp Glu lie Phe Pro Asn Met Leu Leu Phe Ala Ala Ala Cys 
595 600 605 

ggg att cca gcg gcg agg gtg aca aag aaa gca gat etc cga gaa get 1872 
60 Gly lie Pro Ala Ala Arg Val Thr Lys Lys Ala Asp Leu Arg Glu Ala 
610 615 620 
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att cag aca atg ctg gat aca cca gga cct tac ctg ttg gat gtg att 192 0 
He Gin Thr Met Leu Asp Thr Pro Gly Pro Tyr Leu Leu Asp Val He 
625 630 635 640 

5 

tgt ccg cac caa gaa cat gtg ttg ccg atg ate ccg aat ggt ggc act 1968 
Cys Pro His Gin Glu His Val Leu Pro Met He Pro Asn Gly Gly Thr 
645 650 655 

10 ttc aac gat gtc ata acg gaa gga gat ggc egg att aaa tac tgagagctc 2019 
Phe Asn Asp Val He Thr Glu Gly Asp Gly Arg He Lys Tyr 
660 665 670 

15 <210> 17 
<211> 670 
<212> PRT 

<213> Arabidopsis thaliana 
20 <400> 17 

Met Ala Ala Ala Thr Thr Thr Thr Thr Thr Ser Ser Ser He Ser Phe 
15 10 15 

Ser Thr Lys Pro Ser Pro Ser Ser Ser Lys Ser Pro Leu Pro He Ser 
25 ** 20 25 30 

Arg Phe Ser Leu Pro Phe Ser Leu Asn Pro Asn Lys Ser Ser Ser Ser 
35 40 45 

30 Ser Arg Arg Arg Gly He Lys Ser Ser Ser Pro Ser Ser He Ser Ala 
50 55 60 



35 



Val Leu Asn Thr Thr Thr Asn Val Thr Thr Thr Pro Ser Pro Thr Lys 

65 70 75 80 

Pro Thr Lys Pro Glu Thr Phe He Ser Arg Phe Ala Pro Asp Gin Pro 

85 90 95 



Arg Lys Gly Ala Asp He Leu Val Glu Ala Leu Glu Arg Gin Gly Val 
40 100 105 110 



Glu Thr Val Phe Ala Tyr Pro Gly Gly Ala Ser Met Glu He His Gin 
115 120 125 

45 Ala Leu Thr Arg Ser Ser Ser He Arg Asn Val Leu Pro Arg His Glu 
130 135 140 

Gin Gly Gly Val Phe Ala Ala Glu Gly Tyr Ala Arg Ser Ser Gly Lys 
145 150 155 160 



50 



Pro Gly He Cys He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 
165 170 175 



Ser Gly Leu Ala Asp Ala Leu Leu Asp Ser Val Pro Leu Val Ala He 
55 180 185 190 

Thr Gly Gin Val Pro Arg Arg Met He Gly Thr Asp Ala Phe Gin Glu 
195 200 205 



60 Thr Pro He Val Glu Val Thr Arg Ser He Thr Lys His Asn Tyr Leu 
210 215 220 
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Val Met Asp Val Glu Asp lie Pro Arg lie lie Glu Glu Ala Phe Phe 
225 230 235 240 

5 Leu Ala Thr Ser Gly Arg Pro Gly Pro Val Leu Val Asp Val Pro Lys 

245 250 255 



10 



25 



40 



Asp lie Gin Gin Gin Leu Ala lie Pro Asn Trp Glu Gin Ala Met Arg 
260 265 270 

Leu Pro Gly Tyr Met Ser Arg Met Pro Lys Pro Pro Glu Asp Ser His 
275 280 285 



Leu Glu Gin lie Val Arg Leu lie Ser Glu Ser Lys Lys Pro Val Leu 
15 290 295 300 

Tyr Val Gly Gly Gly Cys Leu Asn Ser Ser Asp Glu Leu Gly Arg Phe 
305 310 315 320 

20 Val Glu Leu Thr Gly lie Pro Val Ala Ser Thr Leu Met Gly Leu Gly 

325 330 335 



55 



Ser Tyr Pro Cys Asp Asp Glu Leu Ser Leu His Met Leu Gly Met His 
340 345 350 

Gly Thr Val Tyr Ala Asn Tyr Ala Val Glu His Ser Asp Leu Leu Leu 
355 360 365 



Ala Phe Gly Val Arg Phe Asp Asp Arg Val Thr Gly Lys Leu Glu Ala 
30 370 375 380 

Phe Ala Ser Arg Ala Lys lie Val His lie Asp lie Asp Ser Ala Glu 
385 390 395 400 

35 lie Gly Lys Asn Lys Thr Pro His Val Ser Val Cys Gly Asp Val Lys 

405 410 415 



Leu Ala Leu Gin Gly Met Asn Lys Val Leu Glu Asn Arg Ala Glu Glu 
420 425 430 

Leu Lys Leu Asp Phe Gly Val Trp Arg Asn Glu Leu Asn Val Gin Lys 
435 440 445 



Gin Lys Phe Pro Leu Ser Phe Lys Thr Phe Gly Glu Ala lie Pro Pro 
45 450 455 460 

Gin Tyr Ala lie Lys Val Leu Asp Glu Leu Thr Asp Gly Lys Ala lie 

465 470 475 480 

50 lie Ser Thr Gly Val Gly Gin His Gin Met Trp Ala Ala Gin Phe Tyr 

485 490 495 



Asn Tyr Lys Lys Pro Arg Gin Trp Leu Ser Ser Gly Gly Leu Gly Ala 
500 505 510 

Met Gly Phe Gly Leu Pro Ala Ala lie Gly Ala Ser Val Ala Asn Pro 
515 520 525 



Asp Ala He Val Val Asp He Asp Gly Asp Gly Ser Phe He Met Asn 
60 530 535 540 
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Val Gin Glu Leu Ala Thr lie Arg Val Glu Asn Leu Pro Val Lys Val 
545 550 555 560 

Leu Leu Leu Asn Asn Gin His Leu Gly Met Val Met Gin Trp Glu Asp 
5 565 570 575 

Arg Phe Tyr Lys Ala Asn Arg Ala His Thr Phe Leu Gly Asp Pro Ala 
580 585 590 

10 Gin Glu Asp Glu lie Phe Pro Asn Met Leu Leu Phe Ala Ala Ala Cys 
595 600 605 

Gly lie Pro Ala Ala Arg Val Thr Lys Lys Ala Asp Leu Arg Glu Ala 
610 615 620 

15 

He Gin Thr Met Leu Asp Thr Pro Gly Pro Tyr Leu Leu Asp Val He 
625 630 635 640 

Cys Pro His Gin Glu His Val Leu Pro Met lie Pro Asn Gly Gly Thr 
20 645 650 655 

Phe Asn Asp Val He Thr Glu Gly Asp Gly Arg He Lys Tyr 
660 665 670 

25 

<210> 18 
<211> 259 
<212> DNA 
30 <213> Artificial Sequence 

<220> 

<221> promoter 
<222> (1) . . (259) 

35 

<220> 

<223> Description of Artificial Sequence : be2promoter 
fragment 

40 <400> 18 

gatctctaaa taattcgaaa tatctttgtt attatttttt tctattcaaa ttgcaattag 60 
acataagtca ttttaactga agctgcattg atgaaaaatt atactatgtc tttatgtata 120 
tatattaatg ttttaaattc ctttatagtg ataaagatgg ttcgaaacat gctacaaatt 180 
attatacgaa gttacttttt ttaatctact ttaacaattt tctaatttca ctattgaaca 24 0 
45 tagataccag cccgggccg 259 



<210> 19 

<211> 400 

50 <212> DNA 

<213> Artificial Sequence 



55 



<220> 

<223> Description of Artificial Sequence :RNAi 4 2 0be2bel 
<220> 



<400> 19 

gaattgttgt tctcatggac atcgttcaca gccatgcatc aaataatact ttagatggac 60 

60 tgaacatgtt tgacggcacc gatagttgtt actttcactc tggagctcgt ggttatcatt 12 0 

ggatgtggga ttcccgcctc tttaactatg gaaactggga ggtacttagg tatcttctct 180 
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caaatgcgag atggtggttg ccatttcaca 
acaaatttca atggtcgtcc aaattccttc 
gcttattaca gagttgacga acgcatgtca 
agtgagctac taccaacagc caatatcgag 



42 

tcaccagaag gaatacctgg agttccagaa 240 
aaagtgctgt ctcctgcgcg aacatgtgtg 3 00 
gaaactgaag tttaccagac agacatttct 3 60 
gagagtgacg 400 



<210> 20 

<211> 1105 

<212> DNA 

10 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : RNAi fragment 



15 <220> 



<400> 20 

gaattgttgt tctcatggac atcgttcaca 
tgaacatgtt tgacggcacc gatagttgtt 

20 ggatgtggga ttcccgcctc tttaactatg 
caaatgcgag atggtggttg ccatttcaca 
acaaatttca atggtcgtcc aaattccttc 
gcttattaca gagttgacga acgcatgtca 
agtgagctac taccaacagc caatatcgag 

25 attcgaaata tctttgttat tatttttttc 
ttaactgaag ttgcattgat gaaaaattat 
aaattccttt atagtgataa agatagttcg 
cgttactttt tttaatctac tttaacaatt 
gcccgggccg tcgacctcga attcgccctt 

30 tattggctgt tggtagtagc tcactagaaa 
tgcgttcgtc aactctgtaa taagccacac 
aatttggacg accattgaaa tttgtttctg 
aatggcaacc accatctcgc atttgagaga 
ttaaagaggc gggaatccca catccaatga 

35 ctatcggtgc cgtcaaacat gttcagtcca 
acgatgtcca tgagaacaac aattc 



gccatgcatc aaataatact ttagatggac 60 
actttcactc tggagctcgt ggttatcatt 120 
gaaactggga ggtacttagg tatcttctct 180 
tcaccagaag gaatacctgg agttccagaa 240 
aaagtgctgt ctcctgcgcg aacatgtgtg 3 00 
gaaactgaag tttaccagac agacatttct 3 60 
gagagtgacg atcaagctga tctctaaata 42 0 
tattcaaatt gcaattagac ataagtcatt 480 
actatgtttt atgtatatat attaattttt 540 
aaacatgcta taaattatta tacgaattta 600 
ttctaatttc actattgaac atagatacca 660 
ggagagtgac gttcgcgtca ctctcctcga 72 0 
tgtctgtctg gtaaacttca gtttctgaca 780 
atgttcgcgc aggagacagc actttgaagg 840 
gaactccagg tattccttct ggtgatgtga 90 0 
agatacctaa gtacctccca gtttccatag 960 
taaccacgag ctccagagtg aaagtaacaa 102 0 
tctaaagtat tatttgatgc atggctgtga 1080 

1105 



<210> 21 
40 <211> 180 
<212> DNA 

<213> Artificial Sequence 



<220> 

45 <223> Description of Artificial Sequence :SBE RNAi 1 



<220> 



<400> 21 

50 actagtggta cttaggtatc ttctctcaaa 
aacagccaat atcgaggaga gtgacgttcg 
gtagctcact caaccaccat ctcgcatttg 



tgcgagatgg tggttgagtg agctactacc 60 
cgtcactctc ctcgatattg gctgttggta 12 0 
agagaagata cctaagtacc ttttggtacc 180 



55 <210> 22 
<211> 420 
<212> DNA 
<213> Artificial 



Sequence 



60 



<220> 

<223> Description of Artificial Sequence :SBE RNAi 2 



WO 2004/078983 PCT/EP2004/002096 

43 



<400> 22 

actagttgga gctcgtggtt atcattggat 
ctgggaggta cttaggtatc ttctctcaaa 
5 tgacgaacgc atgtcagaaa ctgaagttta 
aacagccaat atcgaggaga gtgacgttcg 
gtagctcact agaaatgtct gtctggtaaa 
tgtaataagc caaccaccat ctcgcatttg 
catagttaaa gaggcgggaa tcccacatcc 



gtgggattcc cgcctcttta actatggaaa 60 
tgcgagatgg tggttggctt attacagagt 120 
ccagacagac atttctagtg agctactacc 180 
cgtcactctc ctcgatattg gctgttggta 240 
cttcagtttc tgacatgcgt tcgtcaactc 3 00 
agagaagata cctaagtacc tcccagtttc 3 60 
aatgataacc acgagctcca ttttggtacc 420 



<210> 23 
<211> 37 
<212> DNA 
15 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : spacer 



20 <220> 



<400> 23 

atcaagctta tcgataccgt cgacctcgaa gcttgat 37 

25 

<210> 24 
<211> 837 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> Description of Artificial Sequence : fragment of be2 
and bel in pHAS3 for RNAi 

35 <220> 

<400> 24 

gaattgttgt tctcatggac atcgttcaca gccatgcatc aaataatact ttagatggac 60 
tgaacatgtt tgacggcacc gatagttgtt actttcactc tggagctcgt ggttatcatt 12 0 

40 ggatgtggga ttcccgcctc tttaactatg gaaactggga ggtacttagg tatcttctct 180 
caaatgcgag atggtggttg ccatttcaca tcaccagaag gaatacctgg agttccagaa 240 
acaaatttca atggtcgtcc aaattccttc aaagtgctgt ctcctgcgcg aacatgtgtg 3 00 
gcttattaca gagttgacga acgcatgtca gaaactgaag tttaccagac agacatttct 360 
agtgagctac taccaacagc caatatcgag gagagtgacg atcaagctta tcgataccgt 420 

45 cgacctcgaa gcttgatcgt cactctcctc gatattggct gttggtagta gctcactaga 480 
aatgtctgtc tggtaaactt cagtttctga catgcgttcg tcaactctgt aataagccac 540 
acatgttcgc gcaggagaca gcactttgaa ggaatttgga cgaccattga aatttgtttc 600 
tggaactcca ggtattcctt ctggtgatgt gaaatggcaa ccaccatctc gcatttgaga 660 
gaagatacct aagtacctcc cagtttccat agttaaagag gcgggaatcc cacatccaat 72 0 

50 gataaccacg agctccagag tgaaagtaac aactatcggt gccgtcaaac atgttcagtc 780 
catctaaagt attatttgat gcatggctgt gaacgatgtc catgagaaca acaattc 837 



